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rrATrCGCACCGCCCGGGTGGGACGTGTVCAAACTTTTCGGT^^ 
?AA^KAGCa:CGGTnAnCGCrn6AGAATCAGTAT/^^^^ 

iSTGTAAnAGGnGCGTGCTTGAACm/^^ 
TTATCT^CCmATCGnACAGTAAATCATmGATGMCTm^ 

S,t]SgAAGTACGCAGC6CTCTAACAAAATGT^^^ 
?ACMGTTGCCCGGC™mmGGTGATTGAAGCATGAWTC^^ 

rAC^GGMGTGATCGAGATAGATGACACCGMGAG^^^^^^ 
CCTGATGGATCTGATGAAAACGGTGATGCTGCATCT^^^^ 
GMGAGGAAGCGCGTAAAAAGGATGAGGGGGCTTCCG^^^ 
iTrATCTGAGATTGAACAAGACGATGATGGCGATGTTATGGAAATCACTGAG 

^^lrrCGMCGG 

GAGAMGAGCCAGAAGATTCTTCAATGCCTGTCGAACAGA^^^ 



GTGAAGCGGCCTGTCGAATGCATCGAACTCGACGACGACGATGMG^^^^^ 
IGAAATTTCTACC 

BCGACAAGCGTl ^^.^.^A^AA/^z-An-^sA a^s^ 

GAGATTGCTCGACAAGCTGGAAGAGTATGTGAAGGAGCAC^^^^ 



GATTCAGGAAATTTCTACCCCTGCCCCAGCTAAAA^GCTAA^ 
GTCAAG 



GCGACAAGCGTTCCAGAAGAGGACAACAATGAGCAGGCGCAGAA 



ATrrAGCAAAAGCCGAAAAGTTCTGGACACTCTTCTCGGAGCAATCAATGC 
SnCAAMGGAGCCTCTGTCGGTTCGGAAGCTGA^^ 
?GT^G-aCTCCCAAACACAATATCATTCCCACCAAGTCM^^ 
I?AT?rAGCACGATCCCGAAATGCCTTTGACGAAGGTTATCAACAGGATGTT 

^gSg^gac^^^^ 



gScatcgatgcgacgtctgtggattccagacgg^^^^ 

?^^GA6CACTCACAAGGAGAATTTGCACTTCACAGGATCC^^ 

ISatgtgtaaagagacggacacgagtgagcaaag/^t^^^^^ 

^TrTMGTTTTTTTTTTTTCATCTTTCAATATTCATTTAATTACAGCGAAACTC 
MCTOTTATTGCAAAATGGGAAGAGAAGGAGTCCAAGTATCCATGTGCAAT 
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GCGCMCTOTACGGAAGCM^ 
GCTCGTCnCGTGAGCMCAGCAAb^^^ 

AGTATTTT^TATMyAn^^^ 
GTGGTCAACMGGAM-nCGCTG^^^ 

CACTTGGIAGCTG^^^^^^ 

S-SSS^CGAG^^^^ 



TCTTATTCTTTACATTCAA- 

tatacacattctattgcgc 

GATTTTT6GAAGATATCTAT 



TATACACATTCTAnGCGG^^^^^ 
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Ttggtcaccgccgacgagacggtactcgccacaacgaccaacaccacttcc 

ATfiTCTGTGGAACCAACGGATCCGAGAAGCGCTGGTGAATCGTCCTCAGAT 

TrGGAGCCAGACAGAATTGAGCAGCTGAAGGCAGAACAGCGCGAAGTGAT 

rrcCGACGCGGCGAATGGTTCCGAAGTCAACGGAAATCAAGAGAACGGAA 

AAGAGGAAGCGGCATCTGCAGACGTGGAAGTGATCGAGATAGATGACACC 

rAAGAGTCTACGGATCCCTCACCTGATGGATCTGATGAAAACGGTGATGCT 

rrATCTACATCGGTTCCAATCGAAGAGGAAGCGCGTAAAAAGGATGAGGGG 

rCTTCCGAAGTGACTGTGGCATCATCTGAGATTGAACAAGACGATGATGGC 

rATGTTATGGAAATCACTGAGGAGCCGAACGGAAAGTCGGAGGATACTGCC 

AACGGAACAGTTACTGAGGAGGTGCTAGATGAAGAGGAGCCAGAACCTTCC 

rTAAACGGAACAACTGAGATCGCTACAGAGAAAGAGCCAGAAGATTCTTCA 

ATGCCTGTCGAACAGAA7GGGAAGGGTGTGAAGCGGCCTGTCGAATGCAT 

rGAACTCGACGACGACGATGATGACGAGATTCAGGAAATTTCTACCGCTGC 

rCCAGCTAAAAAAGCTAAAATTGATGATGTCAAGGCGACAAGCGTTCCAGA 

AGAGGACAACAATGAGCAGGCGCAGAAGAGATTGCTCGACAAGCTGGAAG 

AGTATGTGAAGGAGCAGAAGGATCAACCATCCAGCAAAAGCCGAAAAGTTC 

TGGACACTCTTCTCGGAGCAATCAATGCGCAAGTTCAAAAGGAGCCTCTGT 

rGGTTCGGAAGCTGATCCTGGACAAAGTTCTCGTTCTCCCAAACACAATATC 

ATrCCCACCAAGTCAAGTTTGCGACTTATTGATTGAGCACGATCCCGAAATG 

rCTTTGACGAAGGTTATCAACAGGATGTTTGGAGAAGAAAGACCAAAGTTGA- 

rTGATTCCGAGAAACGAGAGAGAGCTCAGCTGAAACAACATAATCCTGTTC 

rAAATATGACAAAACTGCTCGTGGACATTGGACAGGATCTCGTTCAAGAAG 

rTACCTATTGTGATATAGTTCACGCGAAGAATCTTCCAGAGGTGCCAAAAAA 

TCTTGAAACCTATAAGCAAGTCGCTGCGCAGTTGAAACCAGTTTGGGAGAC , 

ATTGAAACGCAAAAATGAGCCGTACAAGTTGAAAATGCATCGATGCGACGT 

rTGTGGATTCCAGACGGAATCAAAGCTGGTTATGAGCACTCACAAGGAGAA 

TTTGCACTTCACAGGATCCAAATTCCAGTGCACCATGTGTAAAGAGACGGAC 

ACGAGTGAGCAAAGAAT6AAGGATCACTACTTCGAAACTCATCTTGTTATTG 

rAAAATCGGAAGAGAAGGAGTCCAAGTATCCATGTGCAATCTGCGAAGAAG 

ACTTCAATTTCAAAGGTGTCCGTGAGCAGCATTACAAGCAGTGCAAGAAGG 

ACTACATTCGCATTCGAAACATCATGATGCCGAAGCAAGACGATCATCTCTA 

TATCAACAGATGGCTCTGGGAGAGGCCCCAATrGGATCCCAGCATTCTTCA 

ACAGCAGCAACAAGCTGCTCTTCAGCAAGCTCAACAAAAGAAGCAACAGCA 

ACrrCTGCATCAACAGCAAGCAGCACAAGCTGCAGCCGCTGCGCAACTCTT 

ACGGAAGCAACAATTACAACAGCAACAACAACAGCAACAGGCTCGTCTTCG 

TGAGCAACAGCAAGCGGCCCAATTCCGGCAAGTGGCTCAACTGCTGCAACA 

ACAATCAGCGCAGGCTCAACGTGCACAGCAGAATCAAGGAAATGTGAATCA 

TAACACTCTGATTGCAGCAATGCAAGCGTCGTTGCGTAGAGGTGGTCAACA 

AGGAAAnCGCTGGCAGTTTCTCAACTTCTCCAAAAGCAAATGGCAGCTTTG 



AGCCAGAACAGTCAAAAGACGCCAA^^ 

rAAATTTGTGATGCGTCAGTGCAGGAAAAGGAGAAGTATCTACAGCATCTTC 
AGACTACTCATAAGCAGATG6TTGGAAAAGTGCTGCAGGACATGTCGCAAG 
GAGCTCCACTGGCATGTTCTCGATGCCGTGACAGATTCTGGACTTATGAAG 
GGTTG6AGCGGCACTTGGTGATGTCGCATGGTCTCGTCACTGCTGATCTGC 
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FIGURES 



//r-l aenomic sequencer 

gaggaacatgtagacgacgattcggtttccgtactctcatgacttttggcg 

AAAATCCTCACGAATTCTTTTTCCGTCATACGTT6AGTTAAAAATCTGGCGAT 
rvi^CG/^GAATGAGMGAGCGmGATGTTTG CCATAAGTA GATTTTACTG 
AAATAAGAAAAAGCTTTAATTAAATATAATGATGAl 1 1 1 1 1 1 1 iCCAACTCAGT 
TTTCGCATTGTTCTGATGTTTTTAGTTCTGTGGCTCTGCGAAGGAAAAGTCG 

aataaatgcagcgaaatttcctgttgtttgtgtattgtacattagacattgaa 

gatgatcatctaaagcagattccaaagcgattcgggtgtctctaaacgatta 

taacatttttaaagcttttgcctaattttaatccttactcgtcgtcatcatcaa 

acttgagactgaaagagagaagtttgttccaaaatgggtcataatcgtcgac 

aggttccaaaccgctgagtttcttcagataaatattctcctgtaagaccgtt 

tccttggttataactgatcccatgtgtctgaaatttgnattacactgttaat 

aatcataaaaataaaagaaaaagtcaagaaagggtcaaatattaatcaggtca 

catcttttttattcaataaaatctcctctctcgttcgtggcaatgcacgtgaa 

atgcgccaacaaccgcgagtgcgccaacacacacacatacgcgtcagcag 

ar.aattcgctctcgtttgaaamagttgtttctttg tttct gctgaaataat. 

gtcagttttccgataamcagcgttttctgactgatttttcttgttgcattc 

acttcctaatagttcattctactccattcttcattttataatctgtttccttcg 

caatttagtgaattaaacacgtaaatcttgtttcagataaattattcaaatagt 

tgcacaaagctcaatagtttagaagtatcttcagtgctggtcactaatacaa 

a^gatccggctatggcttctccaggctatcggtctgtgcagtccgatcg 

^^aatcacctaacagagctggaaacgagaattcaaaatcttgccgataat 

tcacaaagagatgatgtcaaattgaaaatgttacaagttagtttcaataattc 

gtgttaagtaatcaatttgttcggttgcaggagatttggagcacaatcgaaa 

atcatttcacactaagttcgcacgagaaagtcgtggagaggctcattctctc 

gttcctacaagttttctgcaacacaagtccacagttcattgctgaaaacaat 

acacaacagcttcgaaagttaatgcttgaaatcattcttcgactttcgaacg 

tagaagccatgaaacatcatagcaaagaaattatcaagcagatgatgaggct 

aatcaccgtggaaaatgaggagaatgccaatttggctatcaaaattgtcacc 

gatcaagggagaagtaccggcaaaatgcaatattgcggagaggtttcacag 

ataatggtctccttcaaaacaatggtcattgatctgacggcgagtggtcga 

gctggtgatatgttcaacataaaagagcataaagctccaccgtcaactagct 

ccgacgagcaagtcatcactgaatatttgaagacttgctactatcaacaaac 

ggttcttctcaacggaacggaaggaaaaccgccattaaaatacaatatgatt 

ccatcagctcatcagtcaacgaaggtgctcctggaggttccgtatctcgtg 

attttcttctatcaacatttcaaaacagcgatccaaaccgaagcgcttgattt 



ctcaaaacaaatcaaataataaccgatgattttgtcagtgcacagtcccgat 

tcctgtcancgtcaacattatggctaagattccagcggtaagtttcgttttt 

tr t aftxtttttttct.gjaatcctgatttttattttt c agtttatggatcttatca 

tgc/^t(5gaccgcnctagtgtcgc3gmcaatgc^^ 

gcccggctgatctgataagtgtccgacgagaagttctgatggctttgaagt 

atttcacatctggagaaatgaagtcgaaattctttccaatgctacctcgact 

catcgctgaggaggttgttctgggaacaggattcactgcgattgagcattt 

gcgagttttcatgtatcaaatgctagcagatctgttgcatcacatgcgaaat 

tctatagactatgaaatgatcacacagtaagtttgaataagactttctgatga 
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aaaatgttgaaatttcagcgtgattttcgtattctgtcgcactcttcacgatc 

ctaacaactcttctcaagtccagattatgtctgctc6gctgctcaactcact 

ggccgaatctctgtgcaaaatggattcacatgataccgtaagacttattcta 

tcaataatcgtatctcacttcgaaataagtttcagactcgtgatctgctcatt 

gaaatcctg6agtcgcacgtggccaagctcaaaactcttgcagtctatcac 

atgcctattctcttccaacaatacggaaccgaaatagactacgaatacaaaa 

gttatgagagagacgccgagaaacctggaatgaatatcccaaaggacacta 

tacgaggagtaccgaaacgaagaatccgtcggctctccattgattcagttg 

aagagctggaattcctg6catcagaaccatccacgtcggaagatgcagatg 

agagtggtggagatccgaacaagcttcctccgccaacaaaagagggaaaga 

aaacgtctcccgaagcgattttaaccgccatgtcaacgatgacacctcctc 

cattggcaattgttgaagctcgaaatcngtgaagtatataatgcatacgtg 

taaattcgtgacaggacaattgagaatcgcccggccatcacaggatatgtat 

cattgttcgaaggagcgagatttattcgaacgtcttctacgatatggtgtaa 

tgtgtatggatgtattcgtgcttccaacaactcgaaatcaaccacaaatgca 

ttcttcaatgcggacaaaagatgagaaagatgctctggagtcgttggcaaa 

cgtttttacaacaatcgaccatgcgatattccgggaaatcttcgaaaagtat 

atggatttcttgattgaaagaatttacaatcggaactatccattgcaattgat 

ggtgaacaccttcttggttcgaaatgaagtgccattcttcgcatctacgatg 

ctttcattcttgatgtctcgaatgaaattgctggaagttagcaatgacaaga 

cgatgctatatgtgaagctcttcaaaattatcttctccgccatcggagccaa 

tggctctgggcttcatggagataaaatgctcacttcatacctcccagagatt 

ctcaaacagtcaactgtcttggcattaacagctcgtgaacctctcaactatt 

tccttttgcttcgtgcattgttccgcagtattggtggtggcgctcaggatat 

tttgtatggaaagttcctgcagttactgccaaatcttcttcaattcttgaata 

aattgacggtgagtttcattttttgatatatcggtaatacactaaaaatccag 

aatcttcagtcatgtcaacatcggattcaaatgcgtgagctcttcgtcgagt 

tgtgtttgactgtgccagttcgactcagttcccttctgccatagctaccgct 

tctgatggatccactggtgtgtgcgatgaatgggagtccgaacatagttac 

ACAAGGATTGAGAACATTGGAATTATGTGTGGATAACTTGCAACCTGAATAT 

cttctcgaaaatatgcttcctgtccgtggagctttgatgcaaggcctctgg 

cgtgttgtatcgaaagctccagatacatcatcgatgacagcagcgttcagg 

atcctcggaaagttcggaggagccaatcgaaaacttctgaatcaaccgcaa 

ancttcaagtagccactttaggcgacgtaagtttatttagtttattctcttc 

ctcgttttaagttctaacattgatcctattaacagactgttcagtcgtacatc 

aatatggaattctcgcggatgggactcgatggcaatcacagcattcacctg 

ccactgtccgagttgatgagagtcgttgccgatcagatgagatatccagct 

gatatgatccttaatccaagtcctgcaatgatcccgtcaactcatatgaaga 

aatggtgtatggaattgtcgaaagccgtcttgttagccggacttggatcttc 

aggaagcccaattactccaagtgcaaatcttccgaagattatcaagaaactt 

cttgaagatttt-gatccaaacaatcgtaccactg-aagtataeacatgtccga 

gggaaagtgatcgagagctttttgtgaatgcacttctcgcaatggcttgtaa 

gttcttaagttcttttctctctaatcagatctatattttaaatttttcagacgg 

aatatggaataaagacggtttccggcatgtctatagcaaattctttatcaaa 

gttctccgccagtttgcgttgattggagtactcgaatacattggtggaaatg 

gatggatgcgtcatgcagaagaggaaggtgttctaccattgtgccttgact 
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rrTCTGTTATGGTTGATGCTCTGATTATTTGTCTCTCTGAAACATCGTCAAG 

cttSt^attgctggtgtcatgtctcttcgtcatatcaatga^^^ 

rTTACACTTCCCGATATTGATCAAATGTCGAAAGTTCCAATGTGCAAATACTT, 

patGGAGAAGGTGTTCAAATTGTGTCACGGGCCTGCTTGGTATGCAAGATC 

TrGTGGAATCAATGCAATTGGATACATGATCGAATCGTTTCCACGAAAATTT 

rTTAKBGACTTTGTGATAGATGTTGTTGATTCGATCATGGAAGTTATTTTGG 

rAACTGTTGAAGAAATATCAAGTGGATCTGCTGATTCTGCATACGATTGTCT 

rAAGAAAATGATGCGAGTCTATTTCATCAAAGAAGAAGGCCAAGAAGAGGA 

rAATCTGACACTCGCGACTATTTTTGTGTCTGCAATCTCTAAGCATTACTTeC 

ArAGTMTGAAAGAGTCAGAGAATTTGCGATTGGTTTAATGGATCATTGTAT 

rrTTCACTCAAGACTTGCACCATCCCTTGATAAGTTCTACTATCGATTCAAG 

rAATTCTTTGAGCCAGAATTAATGCGGGTGCTCACAACAGTTCCAACAATGT 

rATTGGCAGACGCAGGAGGAAGTTTGGATGGAGTTCAAAACTATATGTTCA 

ArTGTCCGGATGGTTTTGATTTCGAAAAAGATATGGACATGTACAAGCGATA 

TTTGTCACATCTGCTGGATATTGCACAAACCGATACATTTACCTTAAACCAAA 

rrAATGCCTTCAAAAAATGCGAGACATGCCCATCGCATTTCCTTCCTCCATT 

rrrAATCACTACACATATTGATTCAATGCGAGCCAGTGCTCTACAGTGTCTT 

rTGATCGCGTATGATCGAATGAAGAAGCAATACATCGACAAGGGAATAGAG 

rTf^GGTGATGAGCATAAGATGATAGAGATCCTCGCACTTCGCAGCTCCAAG 

ATrACAGTTGATCAAGTCTACGAGAGCGATGAATCTTGGAGACGATrGATGA 

rARrrCTATTGAGAGCAGTCACTGACAGAGAAACTCCTGAAATTGCGGAGA 

ArCTTCATCCTTCACTTTTGAAGGTCTCACCAATATCCACAATCATCATCGCA 

ArATTTGGTGCTTCTTACATAAGAAATATTAGTGGAGCAGGAGATGACAGTG 

ATTrAGATCGTCATATTTCGTACAACGATATAATGAAGTTCAAGTGTCTCGTG 

rARCTCAATCCAAAGATTCTGGTCACAAAAATGGCAGTGAATCTCGCAAATC 

AAiaGGTTAAATATAAGATGAGTGACAAGATCTCTAGGATTTTGTCAGTTCC 

rAGTAGCTTCACTGAAGAGGAGCTCGATGATTTCGAAGCGGAGAAGATGAA 

ArGAATTCGAGAGTTGGATATGATTGGTCATACGGTTAAAATGCTTGCTGGA 

TGCCCAGTGACCACATTCACGGAGCAAATTATTGTGGATATCAGTCGTTTTG 

rTGCTGATTTTGAGTATGCTTATTCGCAAGATGTACTTGTAAATTGGATTGAT 

rATGTCACAGTAATCCTCAACAAAAGTCCCAAAGATGTATGGAAGTTCTTCT 

TrTCTCGAGAATCAATTCTAGATCCTGCACGCAGATCCTTTATTCGAAGAAT 

rATAGTCTATCAATCAAGTGGTCCACTGCGACAGGAATTCATGGATACTCCG 

rAATATTTTGAGAAACTCATTGATCTTGACGATGAGGAGAATAAGGATGAAG 

AT^AGAGAAAAATCTGGGATCGTGATATGTTTGCATTTTCGATTGTCGATCG 

TATCTCGAAGAGCTGCCCTGAGTGGCTTATTTCTCCGAATTCCCCAATTCCA 

AGAATTAAGAAGTTGTTCTCCGAAACGGAATTCAATGAGCGATATGTGGTTC 

rAG€ATTGACTGAGGTGAAGAAATTTCAAGAAGAGATCATAGTGAAACGGA 

TrACAGAGCACAAGTACAAGGTTCCGAAGCTGATTCTGAATACCTTCGTGA 

iATATITGAGGTAATTTCMGATAGTTTGTAAAAATTAATTACAAAGAAATATA 

- rrAAMCTGA^CCCCfl;^^ 
TArrrrCTCGAAAAATCCTTCAAAATACCAAAAAATTCGAATTCTCACTTCT 

AATTATTTTTGAATTTTTAAATAATTTTTGAACATTTCTCTATGAAATTCATGTT 

TTGGGCCTATTTCAGGCTATAAAAATTATTTTTCTGATTTTAAATAACTTGCA^ 

ATTTCAGGCTCAACATCTATGACTACGATCTATTCATCGTTATCGCCTCGTGT 

TTCAATGGCAATTTCGTCACCGATCTCTCTTTTCTTCGCGAATATCTTGAAAC 
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ArCGATATGATACGGATGAMTTGTTGGCACCGCACCMTAGATGATTC^^^ 

rrnAACATCAGTCATTGATTCTCATCGTAATTATCTGAGCGATGGAATGGT 

CATTGTTTTCTATCAACTTTGCACATTGTTCGTACAAAACG^^^^^ 

TTGACAATAATAACTGCAAGAAACAAGGTGGACGCCTACGGATCCTGATG^^ 

rTTCGCCTGGCCGTGCCTGACCATGTACAATCATCAAGATCCAACAATGCG 

?TACACTGGATTCTTCTTCTTGGCCAATATTATAGAGCGTTTCACAATTAAT^ 

rf:5ApX\TCGTGCTTCAAGTGTTCCATCM 

rArTAGAGATCAAATCCGGAAAGCCATTGATATATTAACTCCAGCTTTGAGG 
ACACGAATGGAAGATGGACACTTGCAAATATTGAGTCATGTGAAGAAAA^ 

TATCTAAMTGATTTTmTAATGTTAAAAATTTAATTTTAAA^ 
CTCCnTMTAATTCCTGAATmCCAGCCAAATGGTGGTTCGCAA 



rTCTACTATCATGTTCGATTGGAGCTTCTCACGCCTCTTCTGAACGGAGTrC 
MCGAGCACTTGTGATGCCAAATAGTGTTCTGGAAAAGTAAGT^^ 
rTTGTTCGTAAACTCACCCCTTGTAAATATTTAGCTGGCAAACTCGACGTCA 
TrrGGTGGAGATCTGCGAGATGGTCATCAAGTGGGAATTGTTCAGAACGCT 
GMMCAGATCATATTATCAGTGACGAAGAAGCTCTCGAAGTTGACAAGCM 
TTGGAtAAGCTGCGAACAGCTTCATCCACAGATCGTTTCGATTTCGAGGAG 
rrTCATAACAAGAGAGACATGCCTGATGCTCAACGCACGATTATCAAAGAG 

cSgccgatgtgattgtcaatatgcttgtccgattctgtatgac^^^^^^ 

^GMTTCGGGTTCTTCGTCCACTTCTCAAAGTGGGAACCATGGTGTCGAGTT 



rRGAGA^^TTGTCAGCTTCCGAt^ 
rrGAATGATAATGCTCTACGCAATGATATAAGTTCTACGGCCTACGCTAATA 

rTATCCAAAATGCACAACACACTCTGGATATGCTGTGTAATATTATTCCTGTT 

ATGCCAAAAACTAGCTTGATGACTATGATGAGACAACTCCAACGGCCACTCA 

TArAATGTCTCAATAACGGAGCTCAGGTATGTGAAGAACGATGAATAGGGG 

rrrATAAATCACTAATTTCTCTTAGAACTTTAAGATGACTCGTCTTGTCACTC 

AAATTGTCAGTCGGTTACTCGAAAAGACAAATGTTTCGGTTAACGGGCTTGA 

TnAGCTGGAGCAATTGAATCAATACATTTCCCGATTCCTACATGAACATTTT 

rGATCTCTTTTGAAGTAAGTTTTATTTTTGAATTTCCATCTTTCA^ 

rAGTTGCAGAAACTTGAGTGGACCAGTGTTGGGAGTTCTCGGAGCATTTTC 

TrTTTTGCGAACAATTTGTGGACACGAGCCAGCATACTTGGATCATTTGATG 

rrTTCATTTGTAAAAGTGATGGAGAGAGCTGCAAAAGAGCACTTGGCGTAT 

GTTGCGAACTCGCAAGATGGAAATATGGTGAAGAGTAAGTTCTATAAAAAGA 



rTGCATGCAfGGAGCTGGTACGTCCCAGAGTCGATCATATCAGTATGGAGA 

TTMGAGATCAATTGTTGGTGGTATTATCGCGGAGCTGATTATCAAATCGAA 

TrACGATAAGATCATCCAGACGTCAGTGAAGCTTCTCGGAGCAATGATTAG 

rACGCAGGATATGGAATTrACAATTCTCACTGTTCTTCCGCTACTTGTTCGT 

ATCCAATCAATTATTGTGACCAAGTTCAAGAATTGCAAGGATCTGATAGCAG 

ACTATCTTGTTGTGGTTATTACCGTTTTTGAGAACAGCGAATATCGGAACTC 
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rr,AAGCTGGATCTCGTCTCTGGGAAGGATTCTTCTGGGGACTCAAGAGTAG 

rf^ATCCTCAAACCCGGGAGAAATTCTCGATAGTTTGGGAGAAGACTTGGCC 

ArACATGGCAACAGTAGATATTGCTCATCGAATGAAATATATCATGCAAAAT 

rAAGArrGGTCCAAGTTGAAACACGCGTTTTGGTTGAAATTCGCACTTTGGG 

rAATGCTACGAACGATTGCCAAACGGCCAACTGATCCGAATAATAAGAGAA 

AGAAAGTGATACTGTTGAAGTGTGCAACTCCATGGAGAACAATTGAATATGC 

AnrGAAATTGAAGGATCAGCCAATGGAAGTGGAAACTGAAATGAAACGAGA 

AnAGCCAGAAGCGATGGAAGTTGACGAAAAAGACTeGCAAGATGATTCTAA 

rRATGCCGGAGAGCCCAAGGAGAAGGAAAAGCTCACATTGGAATTATTGCT 

TGCTGGACAACAAGAACTTTTGGATGAAGCTTCCAATTATGATTTTGCGGAT 

rCTCTAGATACAGTATCCCAGATTACATTTGCACTTAATGGTAAATrGTTCAA 

AGTTTATGAATATTTTTCTTAAAAATCACAATTTTCAGAGAATCAAGT6ACAA 

rrAAGATGTGGGTAGTGTTGTTCAAATCATTCTGGAGTTCCTTATCACAATC- 

rGAAATCGAAGATTTCACGGCGCTAGTCGTTCCGTTTATGAGCAGTGGAGT 

rrATAATAATTATCAGACGGGTGTACAGGATAGTGTGCTTGCTGTTTGGCTT 

rAAGCTGTTGGTGACGCTGTTCATTTGCCGTCCAGATTGATTGAGGTACGTT 

rT^AAAATGAATGCTGGAAAAAATTCGATTTTTCTGTTTAAAAAAAGTTAAAA 

TTTCCGATTTTTTGAATAGCAAAAAAAAAAGAAAACATTTATTTTGAAAAAAGA 

rTCCTCACCGGAATTTTTTAATAAATAAATTTAAAAAAAGAAAAAAAACTAAA 

AArTTCAATTTTTGAAAATCAAAAAAAAAATTACAGAAACAGACGAGGTAAAA 

AATVlTAAAAAAGTTCTGTAAAAAAAATGGAGAATCACAGTmCGTTGTGTT 

TTCTGAAAAAAATTTGAAAAATTAAAAATTAACGAmTTTGGTTTTTAATTTA 

AAAAAATATACGAAAAAAGACTGAAGAACTTTTmGTCAAAAAAACTTGATT 

TtgaTGAGGGAAAAAGTTCAAAAACTTGGAGAAATCATCGGA^^ 

rATTCAATAAAAATTTCCAAAAAAAAAAATTGAACATTTATGATTTTTGGGTAT 

TTTGAAAAATTGAAAAATTACGCTTAATTTTTAGATTAMP^^ 

ArCAACACTCCTTTTGAAACTTGACACTTTTGAAACGl I M i i M i I I iGCAAT 

AAT^^^TTTCTCATTTCAGTTTATCTCATCAAAACACGAATGCTGGCATACCG 

f^^ATCAGGCTTCTCGAGAATCATATATGGAC^ 

rACGTTACTCCGAGAAATGAAAGTGGCACCAGGTCTCGCTGGAGATATTGA 
rACACTCGAATCTCTTGGAACACTCTACAATGAGATATCAGAGTTTGATCAG 
TTCGCTGCAATCTGGGAACGCCGTGCTGTATTTCCTGATACGATGA6AGCA 
ATGTCAGCTATGCAATTGGGAGATATGGAATTAGCTCAATCTTATCTGGAAA 
AATCAATGAGCAGTACGTATGAAACTCTTGCTCCGACAATCAATCGTAAGTT 
TGGATCAATCGGTTGTACTTCTCACACAAAATAGTATTCCTTTCAGCAAACAA 
rArTTCAAATTCGGAGAAGCATGTTTCTCCGATTATTGACAAAGAATACGAT 
rATTGGATGGAGATGTAGATCACAAATTGCTCGGAGCTTCTTCAGTGGCAAA 
ATGT6GCCGACGTATGCAATGGCAAAGACATGCAACATGTTCGTGGCCTGA 
TCAACGCAGCATCTCACATTCCGGACTGGAATGTGGTCGAGGAGTGTAAAA 

I . ^ ^ ^ ^ ^ — 1 — I- /-» A 1 — TT" A A ' I I A ^ A r^T j I j ^ ^ 



AATTTGATGAGTACTGTTATGGTTAGTTTAAGTCAAAAAGTGATATATAATTA 

TTGTTTAATTTTTCAGCGAATGAATGAAAACTCAAGCCCGACACATATGAAG 

rAACGATGCAAAATTGCAATTCAAGAGTGCACAGAAGCTCATATTAGTCGTT 

rGAGAGCACTTCCGTCAGTTGTTTCATATGGTCATGTCAAGATTCTTCAGGC 

AATGAACTTGGTTCGAGAAATTGAAGAGTCTACAGATATTCGCATTGCTCTG 

CTCGAGGCCCCATCAAACAAAGTGGATCAGGCGTTGATGGGCGATATGAAG 
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tcgttgatgaaagtattccgaaatagaacaccaaccacttcggatgatatgg 

gattcgtttggacttggtatgattgga6gaatcagattcatggaatgatgct 

tcaaagattcgaatangggataaagtaggactcaacgtcgctgcaactgga 

aaccagtcaattgttCcgattcattcaatggctcaagcacagttggccgtag 

ccaaacatgccaagaatcnggattccataatttaacgaaagatctactcaa 

caaattagctggattgacagccataccgatgatggatgctcaagataaagtt 

tgcacttacggcaagacacttcgcgatatggcaaac agtgc ggctgacgaa 

agagtgaaaaatgagctattgtgtgaagcgcttgaagttttggaagatgtgc 

gaattgatgatctacagaaggatcaggttgctgcattgctttatcatcgtgc 

taatattcattcagttcttgatcagtaagttttcaatgccgaaaaaaaattaa 

agttttacaaaaataaatttcagagctgaaaatgctgactacaccttctccgc 

agcctctcaacttgtcgacttgcaaaatagtgtgacaaccactggaatcaag 

ctcatgaaaaattggggccaccatctttacaagagattctt'ctctacgacag 

tttgcaaggaaaccggaaacaacttcggacggcaggctctcgcttgttact 

tcattgcggctcgtgtggataacgatatcaaggcgagaaaaccgattgcca 

agattttgtggctctcgaagcacttgaatgcgtgtggatcacatgaagtgat 

gaatcgggttattaagaagcaacttcattcacttaatctcttcaattggcttt 

actggcttccacaattggttactgatgttcgatataaacc aaat tcgaacttt 

gttctgattctctgcaaggtaagtmgaaatamaaatattttcagaatttt 

aaatgaaattcAtttgcagatggctgctgctcatccacttcaagtattttacc 

acattcgggaggcagttagcgttgacgatattgactcggttctcgaagaag 

attacactgatgagcaaatgtcgatggatgtttcggatgaggattgtntgc 

agacgatccaccatttgatagaattctgaaaatatgtctgaaatatcgtccaa 

ctgatattcgagtcttccatcgtgtcctcaaagaacttgacgagatgaatga 

gacatgggngaacgtcacttgcgtcatgcgatctgcctcaaggatcagat 

gttcaaagatttctcggaacaaatggacgcgacgttcaatgagatgcaatat 

tcggAggatgtgactatgatgacgttgagatggaggaaacagctggaagaa 

gacttggtgtatttccaagagaattataatcttgatttcctggagattcgtaa 

caagcgaaagatgatcgtgacgaagggatgtatgggagtcgagaaaagtca 

GATAATGTTCGAAAAAGAGCTGAGTCAAGTGTTCACAGAGCCGGCCGGCAT 

gcaagatgaatttgattttgtcacaaatatgactaatatgatggtctcacagt 

tggatattcatgcagtcgatgctccacgccctcagggatatattcgtattgt 

tctcgactggattcgagcgattcgtcgtcgtttcgatcgacttccacgaag 

aatccctctggaatcgtcaagcccatatctcgccagattcagccatcgtaca 

ggatgcatcgaaatgccatacgatttgctcaacgttttgcgcgccaagaat 

catactctgatggcttccaatcaaacggggcaatacatatccatgctctctc 

gatttgagccaaactttgagattgtgatcaaaggtggtcaagtgataagaaa 

gatctatattcgaggacaaaccggaaagagtgcggcgttttatctgaagaa 

atctgtgcaggatgagccaactaaccgagttccacaaatgttcaaacatctt 

f^axcacgitctacaaac(igatagai3agtcgg€gagm 

ccaacagtgctgcagatgagagfcggacag 

GCATCCGTTCAACCATATGCAATGCCACCGGATTGTACCAGAAACTATCCAG 
CATCACAAATCGACATTGTTCATCCATATGATGTGCTGACTGCCACTTTGAAT 
GGAAGTTATTATCCGGATGATATGGTATTGCACTTCTTTGAGAGATTCGCCC 
AAAGTTCTTCATCCATCGGACAACCTCTTCCAACTCCGACGAACCAAGATGG 
AACAGTTGCTCCGCCACGACTAACGGAAGCTCACCACATCAAGAATATTATT 
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TATGAGTACGTTTGAGAAGCTAGTGTCTAAAATAATAAnAATGTAAAAAAAT 

TTTCAGAGACTTTGCCGGAGATATGATCCCATTCCGACTTCTCTAGGACTAC 

rTCACTGCACGATATCCTGATCCGGTTATGTACTATGCAATGAAGAAGCAAT . 

TGCTGCACAGTCTCGCCGTCCTATCCACAATCGAATATCATTGCAATCTGAC 

ACCAATGGGACCT6ATCAAATGAT6AT6ACAATGAATACTGGAGTCCTTAGC 

AATCCTTCATATAGATTCGAAATCCGAGGAGGACGATCACTTCATGATATTC 

AACACTTTGGACATGAAGTTCCATTCCGATTGAGTCCAAATCTATCGATTTTG 

GTTGGTGTTGCACAGGATGGTGACTTGTTATGGAGTATGGCTGCTGCGTCA 

aAatgtttgatgaagaaggaacctgaagttatcatgagaccgttagtatggg 

ATGAATTCGCCAACAATACAGATTGCGACAAATCGGTAATTTTACTTTAATAT 

gctaatagggaattgaactaatgttttccaagcgtttgcaggtattcgcgtg 
-tcatgcatcgaattcttacatcaatggtgtcgcgagcaagcttcgaaacacg 
aatagcgccgaggccaaactcagaaaggacgattgtgtgtcgctgatcagt 
rgagccaaggattcggataatctggcccgaatgccacccacctaccacgc 
gtggttcI^tctcataattaccgttctctattttgatcccgcctcccactc 

T^CACAGATCTCTATACATTTGTCAAATGTTTCCAAATCTTTTATCTGCCCATA 

cattcgtttttattgttttgtttcttttctttctttatttgttttctaaacttta 
agatttatgtaaatatttaactgcgctggtatttatgaaaaattcagataaag 
tVitcaagtttaaaaaatcgaaaattcgaagtcggaagttctcttacaggtgt 
■ aqj;^gtaggcacaatggcaataggtacatggaaggcttgcggaaggcaca 
tgggtaggcataagatcgaaaaataagctgatatataaatatagataggtat 
tggttaggcacaaattaggcacgtaggtgtgagctggcaaataggtaggca 
tgacgttcggcaaatcggcaaattgccgatttggcgaaaattttcaaatccg 
gcgatttgccggaaatgtttagagaaattttnataagacagaaaaacttaca 



TATAGCGCCCCCCCCCCCCCCCCCCCGCCGTATTTTTTGGCGTrrCACGCC 
AlTCTGAf1lTtATtTTretGATTtftTfm 

GGATGCTTGGAGAGAAATATGAGGGAGCAAAATAAAGAATGTGGTGAAGTGA 

ATGTCGAATAGATTTTTTGAGGTTATGGTTAAGAAGGGAGGTCCCACGACGT 

ATTGATCCTTGATGGAGTTAAGAAATTATGATGTTTTAATTGATTTCATTCCAC 

TTCTGGACACAGAAGGACGAATAGTGCAATCTGGTACAAGTTTATCACCACC 

TACAACTTCGTGGATTTGTGGAAAATGTTTGAGAGATGTGTGGATGAGTGTC 

TCAGAACATCTTGGTCAGGTTTGGAGTCGATCCCAGGGCTGGGAGCCGAGA 

ATGGGCCTCTAAGAC 
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i r»Dp £pniienc-e 

ATGGATCCGGCTATGGGTTCTCCAGGCTATCGGTCTGTGCAGTCCGATCGG 
AGTAATCACCTAACAGAGCTGGAAACGAGAATTCAAAATCTTGCCGATAATT 
CACAAAGAGATGATGTCAAATTGAAAATGTTACAAGAGATTTGGAGCACAAT 
CGAAAATCATTTCACACTAAGTTCGCACGAGAAAGTCGTGGAGAGGCTCATT 
CTCTCGTTCCTACAAGTTTTCTGCAACACAAGTCCACAGTTCATTGCTGAAA 
ACAATACACAACAGCTTCGAAAGTTAATGCTTGAAATCATTCTTCGACTTTCG 
AACGTAGAAGCCATGAAACATCATAGCAAAGAAATTATCAAGeAGATGATGA 
GGCTAATCACCGTGGAAAATGAGGAGAATGCCAATrrGGCTATCAAAATTGT 
CACCGATCAAGGGAGAAGTACCGGCAAAATGCAATATTGCGGAGAGGTTTC 
ACAGATAATGGTCTCCTTCAAAACAATGGTCATTGATCTGACGGCGAGTGGT 
CGAGCTGGTGATATGTTCAACATAAAAGAGCATAAAGCTCCACCGtCAACTA 
GCTCCGACGAGCAAGTCATCACTGAATATTTGAAGACTTGCTACTATCAACA 
AACGGTTCT7CTCAACGGAACGGAAGGAAAACCGCCATTAAAATACAATATG 
ATTCCATCAGCTCATCAGTCAACGAAGGTGCTCCTGGAGGTTCCGTATCTC 
GTGATTTTCTTCTATCAACATTTCAAAACAGCGATCCAAACCGAAGCGCTTG 
ATTTCATGAGGCTTGGTCTTGATTTTCTAAATGTCAGAGTTCCAGACGAGGA 
TAAACTCAAAACAAATCAAATAATAACCGATGATTTTGTCAGTGCACAGTCCC 
GATTCCTGTCATTCGTCAACATTATGGCTAAGATTCCAGCGTTTATGGATCTT 
ATCATGCAAAATGGACCGCTTCTAGTGTCGGGAACAATGCAGATGCTCGAG 
CGGTGCCCGGCTGATCTGATAAGTGTCCGACGAGAAGTTCTGATGGCTTTG 
AAGTATTTCACATCTGGAGAAATGAAGTCGAAATTCTTTCCAATGCTACCTC 
GACTCATCGCTGAGGAGGTTGTTCTGGGAACAGGATTCACTGCGATTGAGC 
ATTTGCGAGTTTTCATGTATCAAATGCTAGCAGATCTGTTGCATCACATGCG 
. AAATTCTATAGACTATGAAATGATCACACACGTGATTTTCGTATTCTGTCGCA 
CTCTTCACGATCCTAACAACTCTTCTCAAGTCCAGATTATGTCTGCTCGGCT 
GCTCAACTCACTGGCCGAATCTCTGTGCAAAATGGATTCACATGAtACGTTT 
CAGACTCGTGATCTGCTCATTGAAATCCTGGAGTCGCACGTGGCCAAGCTC 
AAAACTCTTGCAGTCTATCACATGCCTATTCTCTTCCAACAATACGGAACCG 
AAATAGACTACGAATACAAAAGTTATGAGAGAGACGCCGA6AAACCTGGAA 
T6AATATCCCAAAGGACACTATACGAGGAGTACCGAAACGAAGAATCCGTC 
GGCTCTCCATTGATTCAGTTGAAGAGCTGGAATTCCTGGCATCAGAACCATC 
CACGTCGGAAGATGCAGATGAGAGTGGTGGAGATCCGAACAAGCTTCCTCC 
GCCAACAAAAGAGGGAAAGAAAACGTCTCCCGAAGCGATTTTAACCGCCAT 
GTCAAC6ATGACACCTCCTCCATTGGCAATTGTTGAAGCTCGAAATCTTGTG 
AAGTATATAATGCATACGTGTAAATTCGTGACAGGACAATTGAGAATCGCCC 
GGCGATGACAGGATATGTATCATTGTTCGAAGGAGCGAGATTTATTCGAACG 
TCTTCTACGATATGGTGTAATGTGTATGGATGTATTCGTGCTTCCAACAACT 
CGAAATCAACCACAAATGCATTCTTCAATGCGGACAAAAGATGA6AAAGATG 



GGAAATCTTCGAAAAGTATATGGATtTCTTGATTGAAAGAATTTACAATCGGA 

ACTATCCATTGCAATTGATGGTGAACACCTTCTTGGTTCGAAATGAAGTGCC 

ATTCTTCGCATCTACGATGCTTTCATTCTTGATGTCTCGAATGAAATTGCTGG 

AAGTTAGCAATGACAAGACGATGCTATATGTGAAGCTCTTCAAAATTATCTTC 

TCCGCCATCGGAGCCAATGGCTCTGGGCTTCATGGAGATAAAATGCTCACT 

TCATACCTCCCAGAGATTCTCAAACAGTCAACTGTCTTGGCATTAACAGCTC 
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T^'r^ArrTCTCAACTATTTCCTTTTGCTTCGTGCATTGTTCCGCAGTATTGGT 



C/CJ>^TGCGTGAGCTCtTeGTCGAGTTGTGTTTqACTGTGCCAGTTC^ 



r^S^TMCTTGCAACCTGAATATCTTCTCGAAAATATGCmC^^^^^ 
rlGCTTTGATGCAAGGCCTCTGGCGTGTTGTATCGAAAGC^^^^^^^ 

r^^TcTTGGA^^^ 

S??fATCMG/^CT^ 
?f^ZcIcMGTCCGAGGG^ 

r^TCTTTATCAAAGTTCTCCGCCAGTTTGCGTTGATTGGA^^^^^ 
?^^TTGGTGGAAATGGATGGATGCGTCATGCAGAAGAGGMG^^^^^ 
JrATTGTGCCTTGACTCGTCTGTTATGGTTGATGCTCTGATTAmGTCTCTC 
^rAlIcATCGTCMGCTTCATCATTGCTGGTGTCATGTCTCTTCGTC 



C'AATGTGCAAATACTTGATGGAGAAGGTGTTCAAATTG^^ 

JGTATGCAAGATCTGGTGGAATCAATGCAATTGGATACATGATCGAATC 



MTGAGACTCTCTCGCTTACACTTCCCGATA^^^ 

caatc 

STnCCA^SXXXXmGfrATVG^^ 
?rrAAGnAmTGGGAACTGTTGAA 

^rr^ACGATTGTCTCAAGAAAATGATGCGAGTCTATTTCATCAAAGAAGAA 
GGCCM^^^^ 

r?AAGCATTACTTCCACAGTAATGAAAGAGTCAGAGAATTTGCGAT^ 

aI^GATCATTGTATGGTTCACTCAAGACTTGCACCATCCCTTGATAAGTTC 

STCGATTCAAGGAGTTCmGAGCCAGAATTAATG^^ 

cagttccaacaatgtcattggcagacgcaggagg^^^^^ 

ftAAACTATATGTTCAACTGTCCGGATGGTTTTGATTTCGAAAAAGATATGGA 

^tISgatatttgtcacatctgctggatattgcac^^^ 

mACCTTAAACCAAAGGAATGCCTTCAAAAAATGCGAGACA^^^ 

1-Stccttcctccattcccaatcactacacatattgattcaatgcgagccagt 

f^l7cTACAG^^ 

ArMGGGMTAGAGCTGGGTGATGAGCATAAGATGATAGAGATCCTCGCAC 
??^^CAGCTCCAAGATCACAGTTGATCAAGTCTACGAGAGCGATGAATCTTG. 

?AGACGATTGATGACAGTTCW^ 

JplAATTGCGGAGAAGCTTCATCCTTCACTTTTGAAGGTCTC^^ 

IrMTGATCATCGCAACATTTGGTGCTTCTTACATAAGAAATATTAGTGGAG 

rAGGAGATGACAGTGATTCAGATCGTCATATTTCGTACAACGATATAATGAA 

rTlCAAGTGTCTCGTGGAGCTCAATCCAAAGATTCTGGTCACAAAAATGGCA 

GTGAATCTCGCAAATCAAATGGTTAAATATAAGATGAGTGACAAGATCTCTA 
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rr,ATTTTGTCAGTTCCCAGTAGCTTCACTGAAGAG6AGCTCGATGATTTCGA 

AGCGGAGAA6ATGAAAGGAATTCGA6AGTTGGATAT6ATTGGTCATACGGT 

TAAA^TGCTTGCTGGATGCCCAGTGACCACATTCACGGAGCAAATTATTGTG 

rAJATCAGTCGTTTTGCTGCTCATTTTGAGTATGGTTATTCGCAAGATGTACT 

TGTAAATTGGATTGATGATGTCACAGTAATCCTCAACAAAAGTCCCAAAGAT 

rTATGGAAGTrCTrCTTGTCTCGAGAATCAATTCTAGATCCTGCACGCAGAT 

rrTTTATTCGAAGAATCATAGTCTATCAATCAAGTGGTCCACTGCGACAGGA 

ATTCATCGATACTCCGGAATATTTTGAGAAACTCATTGATCTTGACGATGAG 

rAGAATAAGGATGAAGATGAGAGAAAAATCTGGGATCGTGATATGTTTGCAT 

TTTCGATTGTCGATCGTATCTCGAAGAGCTGCCCTGAGTGGCTTATTTCTCC 

rAATTCCCCAATTCCAAGAATTAAGAAGTTGTTCTCCGAAACGGAATTCAAT 

rAGCGATATGTGGTTCGAGCATTGACTGAGGTGAAGAAATTTCAAGAAGAG 

ATrATAGTGAAACGGATGACAGAGCACAAGTACAAGGTTCCGAAGCTGATT 

rTGWACCTTCCTGAGATATTTGAGGCTCAACATCTATGACTACGATCTATT 

rATCGTTATCGCCTCGTGTTTCAATGGCAATTTCGTCACCGATCTCTCTTTTC 

TTCGCGAATATCTTGAAACTGAAGTCATCCCGAAAGTGCCGTTACAATGGCG 

rAGAGAGCTGTTTCTTCGAATTATGCAGAAGTTTGATACGGATCCACAAACT 

r^CTGGAACAAGTATGCAGCATGTGAAGGCCCTTCAATATTTGGTTATTCCCA 

rGTTGCATTGGGCGTTCGAGCGATATGATACGGATGAAATTGTTGGCACCG 

rACCAATAGATGATTCGGATrcfTCGATGGATGTAGATCCGGCAGGCAGCT 

rGGATAACCTTGTGGCTCGTTTAACATCAGtCATrGATTCTCATCGTAAtTAT 

rTGAGCGATGGAATGGTCATTGTTTTCTATCAACTTTGCACATTGTTCGTAC 

AAAACGCCTCCGAACATATTCACAATAATAACTGCAAGAAACAAGGTGGACG 

rcTACGGATCCTGATGCTCTTCGCCTGGCCGTGCCTGACCATGTACAATCA 

TrAAGATCCAACAATGGGGTACACTGGATTCTTCTTCTTGGCCAATATTATA 

rAGCGTTTCACAATTAATCGGAAAATCGTGCTTCAAGTGTTCCATCAACTTA 

TGACTACTTATCAGCAGGACACTAGAGATCAAATCCGGAAAGCCATTGATAT 

ATTAACTCCAGCTTTGAGGACACGAATGGAAGATGGACACTTGCAAATATTG 

AGTCATGTGAAGAAAATTCTTATCGAAGAATGCCATAATTTGCAACATGTTCA 

GCATGTTTTCCAAATGGTGGTTCGCAATTATCGTGTCTACTATCATGrrCGAT 

TGGAGCTTCTCACGCCTCTTCTGAACGGAGTTCAACGAGCACTTGTGATGC 

rAAATAGTGTTCTGGAAAAATTTAGCTGGCAAACTCGACGTCATGCGGTGG 

AGATCTGCGAGATGGTCATCAAGTGGGAATTGTTCAGAACGCTGAAAACAG 

ATCATATTATCAGTGACGAAGAAGCTCTCGAAGTTGACAAGCAATTGGATAA 

GCTGCGAACAGCTTCATCCACAGATCGTTTCGATTTCGAGGAGGCTCATAA 

rAAGAGAGACATGCCTGATGCTCAACGCACGATTATCAAAGAGCACGCCGA 

T6TGATTGTCAATATGCTTGTCCGATTCTGTATGACGTTCCATCAGAATTCG 

GGTTCTTCGTCCACTTCTCAAAGTGGGAACCATGGTGTCGAGTTGACCAAA 

AAATGTCAGCTGCTTCTACGTGCAGCCCTACGACCAAGCATGTG6GGAGAA 

TTTGTCAGCTTCCGATTAACAATGATCGAAAAGTTTrrGT-eAATTCCGAATGA 

TAATGCTCTACGCAATGATATAAGTTCTACGGCCTACGCTAATACTATCCAA 



CTCAATAACGGAGCTCA6AACTTTAAGATGACTCGTCTTGTCACTCAAATTG 

TCAGTCGGTTACTCGAAAAGACAAATGTTTCGGTTAACGGGCTTGATGAGCT 

GGAGCAATTGAATCAATACATTTCCCGATTCCTACATGAACATTTTGGAT.CTC 
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TTTTGAATTGCAGAAACnGAGTGGACCAGTGTTGGGAGTTCTCGGAGCATT 

TTCTCTTTTGCGAACAATTTGTGGACACGAGCCAGCATACTTGGATCATTTG 

ATGCCTTCATTTGTAAAAGTGATGGAGAGAGCTGCAAAAGAGCACTTGGCG 

TATGTTGCGAACTCGCAAGATGGAAATATGGTGAAGAATTTCTTTCCAGATG 

TFGCTGAATTGTTGTGTGCATGCATGGAGCTGGTACGTCCCA6AGTCGATC 

ATATCAGTATGGAGATTAAGAGATCAATT6TTGGTGGTATTATC6CGGAGCT 

GATTATCAAATCGAATCACGATAAGATCATCGAGAC6TCAGTGAAGCTTCTC 

GGAGCAATGATTAGCACGCAGGATATGGAATTTACAATTCTCACTGTrCTTC 

CGCTACTTGTTCGTATCCAATCAATTATTGTGACCAAGTTCAAGAATTGCAA 

GGATCTGATAGCAGACTATCTTGTTGTGGTTATTACCGTnTTGAGAACAGC 

GAATATCGGAACTCGGAAGCTGGATCTCGTCTCTGGGAAGGATTCTTCTGG 

GGACTCAAGAGTAGCGATCCTCAAACCCGGGAGAAATTCTCGATAGTTTGG 

GAGAAGACTTGGCCACACATGGCAACAGTAGATATTGCTCATC6AATGAAAT 

ATATCATGCAAAATCAAGATTGGTCCAAGTTCAAACACGCGTTTTGGTTGAA 

ATTCGCACTTTGGGGAATGCTACGAACGATTGCCAAACGGCCAACTGATCC 

GAATAATAAGAGAAAGAAAGTGATACTGTTGAACTGTGCAACTCCATGGAGA 

ACAATTGAATATGCAGCGAAATTGAAGGATCAGCCAATGGAAGTGGAAACT 

GAAATGAAACGAGAAGAGCCAGAACCGATGGAAGTTGACGAAAAAGACTCG 

CAAGATGATTCTAAGGATGCCGGAGAGCCCAAGGAGAAGGAAAAGCTCACA 

TTGGAATTATTGCTTGCTGGACAACAAGAACTTTTGGATGAAGCrrCCAATT 

ATGATTTTGCGGATGCTCTAGATACAGTATGGCAGATrACATrTGCACTTAAT 

GAGAATCAAGTGACAAGCAAGATGTGGGTAGTGTTGTTCAAATCATTCTGGA 

GTTCCTTATCACAATCCGAAATCGAAGATTTCACGGCGCTAGTCGTTCCGTT 

TATGAGCAGTGGAGTGCATAATAATTATCAGACGGGTGTACAGGATAGTGT 

GCTTGCTGTTTGGCTTGAAGCTGTTGGTGACGCTGTTCATTTGCCGTCCAG 

ATTGATTGAGTTTATCTCATCAAAACACGAATGCTGGCATACCGGAATCAGG 

CTTCTCGAGAATCATATATGGACAATTCCAAAGCAACTCAACAACACGTTAC 

TCCGAGAAATGAAA6TGGCACCAGGTCTCGCT6GAGATATTGAGACACTCG 

AATCTCTTGGAACACTCTACAATGAGATATCAGAGTTTGATCAGTTCGCTGC 

AATCTGGGAACGCCGTGCTGTATTTCCTGATACGATGAGAGCAATGTCAGC 

TATGCAATTGGGAGATATGGAATTAGCTGAATCTTATCTGGAAAAATCAATG 

AGCAGTACGTATGAAACTCTTGCTCCGACAATCAATCCAAACAACACTrCAA 

ATTCGGAGAAGCATGTTTCTCCGATTATTGACAAAGAATACGATCATTGGAT 

GGAGATGTACATCACAAATTGCTCGGAGGTTCTTCAGTGGCAAAATGTGGC 

CGACGTATGCAATGGCAAAGACATGCAACATGTTCGTGGCCTGATCAACGC 

AGCATCTCACATTCCGGACTGGAATGTGGTCGAGGAGTGTAAAAGTCAGAT 

AGCTGGATGTAtTCCACCAAGTTTCCATTTAGATTACACTCTTTTCAATTTGA 

TGAGTACTGTTATGCGAATGAATGAAAACTCAAGCCCGACACATATGAAGGA 

ACGATGCAAAATTGCAATTCAAGAGTGCACAGAAGCTCATATTAGTCGTTGG 

'AGAGCACTTCCGTCAGTTGTTTCATATGGTCATGTCAAGATTCTTCAGGCAA- 

TGAACTTGGTTCGAGAAATTGAAGAGTCTACAGATATTCGC/mGCTCTGCT 

CGA 



GTT6ATGAAAGTATTCCGAAATAGAACACCAACCACTTCGGATGATATGGGA . 
TTCGTTTCGACTTGGTATGATTGGAGGAATCAGATTCATGGAATGATGCTTC 
AAAGATTCGAATATTGGGATAAAGTAGGACTCAACGTCGCTGCAACTGGAAA 
CCAGTCAATTGTTCCGATTCATTGAATGGCTCAAGCACAGTTGGCCGTAGCC 
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AAACAT6CCAAGAATCTTGGATTCCATAATTTAACGAAAGATCTACTCAACAA 
ATTAGCT6GATTGACAGCCATACCGATGATGGATGCTCAAGATAAAGTTTGC 
ACTTACGGCAAGACACTTCGCGATATGGCAAACAGTGCGGCTGACGAAAGA 
GTGAAAAATGAGCTATTGTGTGAAGCGCTTGAAGnTTGGAAGATGTGCGAA 
TTGATGATCTACAGAAGGATCAGGTTGCTGCATTGCTTTATCATCGTGCiTAA 
TATTCATTCAGTTCTTGATCAAGCTGAAAATGCTGACTACACCTTCTCCGCA 
GCCTCTCAACTTGTCGACTTGCAAAATAGTGTGACAACCACTGGAATCAAGC 
TCATGAAAAATTGGGGCCACCATCTTTACAAGAGATTCTTCTCTACGACAGT 
TTGCAAGGAAACCGGAAACAACTTCGGACGGCAG6CTCTCGCTTGTTACTT 
CATTGCGGCTCGTGTGGATAACGATATCAAGGCGAGAAAACCGATTGCCAA 
GATTTTGTGGCTCTCGAAGCACTTGAATGCGTGTGGATCACATGAAGTGAT 
GAATCGGGTTATTAAGAAGCAACTTeATTCACTTAATCTCTTCAATTGGCTTT 
ACTGGCTTCCACAATTGGTTACTGATGTTCGATATAAACCAAATTC6AACTTT 
-GTTCTGATTCTCTGCAAGATGGCTGCTGCTCATCGAGTTCAAGTATTTTAeC 
ACATTCGGGAGGCAGTTAGCGTTGACGATATTGACTCGGTTCTCGAAGAAG 
ATTACACTGATGAGCAAATGTCGATGGATGTTTCGGATGAGGATTGTTTTGC 
AGACGATCCACCATrTGATAGAATTCTGAAAATATGTCTGAAATATCGTCCAA 
CTGATATTCGAGTCTTCCATCGTGTCCTCAAAGAACTTGACGAGATGAATGA 
GACATGGGTTGAACGTCACTTGCGTCATGCGATCTGCCTCAAGGATCAGAT 
GTTCAAAGATTTCTCGGAACAAATGGACGCGACGTTCAATGAGATGCAATAT 
TCGGAGGATGTGACTATGATGACGTTGAGATGGAGGAAACAGCTGGAAGAA 
GACTTGGTGTATTTCCAACAGAATTATAATCTTGATTTCCTGGAGATTCGTAA 
CAAGCGAAAGATGATCGTGACGAAGGGATGTATGGGAGTCGAGAAAAGTCA 
GATAATGTTCGAAAAAGAGCTGAGTCAAGTGTTCACAGAGCCGGCCGGCAT 
GCAAGATGAATTTGATTTTGTCACAAATATGACTAATATGATGGTCTCACAGT 
TGGATATTeATGCAGTCGATGCTCCACGCCCTCAGGGATATATTCGTATTGT 
TCTCGACTGGATTCGAGCGATTCGTCGTCGTTTCGATCGACTTCCACGAAG 
AAtCCCTCTGGAAtCGTCAAGCCCATATCTCGCCAGATTCAGCCATCGTACA 
GGATGCATCGAAAT-GCGATAGGATTTGeTeAACGTTTTGCGCGCCAAGAAT 
CATACTCTGATGGCTTCCAATCAAACGGGGCAATACATATCCATGCTCTCTC 
GATTTGAGCCAAACTTTGAGATTGTGATCAAAGGTGGTCAAGTGATAAGAAA 
GATCTATATTCGAGGACAAACCGGAAAGAGTGCGGCGTTTTATCTGAAGAA 
ATCTGTGCAGGATGAGCCAACTAACCGAGTTCCACAAATGTTCAAACATCTT 
GATCACGTTCTACAAACCGATAGAGAGTCGGCGAGAAGACATCTTCATGCT 
CCAACAGTGCTGCAGATGAGAGTCGGACAGAAGACGACACTCTACGAAGTT 
GCATCCGTTCAACCATATGCAATGCCACCGGATTGTACCAGAAACTATCCAG 
CATCACAAATCGACATTGTTCATCCATATGATGTGCTGACTGCCACTTTCAAT 
GGAAGTTATTATCCGGATGATATGGTATTGCACTTCTTTGAGAGATTCGCCC 
AAAGTTCTTCATCCATCGGACAACCTCTTCCAACTCCGACGAACCAAGATGG 
"AAC-AGTrGCXCCG CO ACG ACTAACG GAAGCTGACCACATC AAG AATATTATT 

TATGAA6ACTTTGCCCGAGATATGATCCCATTCCGACTTCTCTACGACTACC 

TCACTGCACGATATCCTGATCCGGTTATGTACTATGCAATGAAGAAGCAATT 

GGTGGAGAGTGTGGGGGTGGT-ATeGAeAATeGAATATeATTGeAATCTGACA 

CCAATGGGACCTGATCAAATGATGATGACAATGAATACTGGAGTCCTTAGCA 

ATCCTTCATATAGATTCGAAATCCGAGGAGGACGATCACTTCATGATATTCA 

ACACTTTGGACATGAAGTTCCATTCCGATTGACTCCAAATCTATCGATTTTG 
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GTTGGTGTTGCACAGGATGGTGACTTGTTATGGAGTATGGCTGCTGCGTCA 

AAATGTTTGATGAAGAAGGAACCTGAAGTTATCATGAGACCGTTAGTATGGG 

ATGAATTCGCCAACAATACAGATTGCGACAAATCGCGTTTGCAGGTATTCGC 

GTGTCATGCATCGAArrCTTACATCAATGGTGTCGCGAGCAAGCTTCGAAAC 

ACGAATAGCGCCGACGCCAAACTCAGAAAGGACGATTGTGTGTCGCTGATC 

AtBTCGAGCCAAGGATTCGGATAATCTGGCCCGAATGCCACCCACCTACCAC 

GCGTGGTTCTAG , ' . 
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TRR-1 protein sequence 
MDPAMASPGYRSVOSDRSNHLTELETRIONLADNSQRDOVKLKMLQEIWSTIE 
■ NHFTLSSHEKWERLILSFLQVFCNTSPQFIAENNTOQLRKLMLEllLRLSNVEAM 
KHHSKEIIKQMMRLITVENEENANLAIKIVTDOGRSTGKMQYCGEVSQIMVSFKT 
MVIDLTASGRAGDMFNIKEHKAPPSTSSDEQVITEYLKTCYYQQTVLLNGTEGK 
PPLKYNMIPSAHQSTKVLLEVPYLVIFFYQHFKTAIQTEALDFMRLGLDFLNVRV 
PDEDKLKTNQIITDDFViBAOSRFLSFVNIMAKIPAFMDLIMQNGPLLVSGTMQML 
ERCPADLISVRREVLMALKYFTSGEMKSKFFPMLPRLIAEEWLGTGFTAIEHLR 
VFMYQMLADLLHHMRNSIDYEMITHVIFVFCRTLHDPNNSSQVQIMSARLLNSL 
AESLCKMDSHDTFQTRDLLIEILESHVAKLKTLAVYHMPILFQQYGTEIDYEYKSY 
ERDAEKPGMNIPKDTIRGVPKRRIRRLSIDSVEELEFLASEPSTSEDADESG6DP 
NKLPPPTKEGKKTSPEAILTAMSTMTPPPLAIVEARNLVKYIMHTCKFVTGQLRIA 
RPSQDMYHCSKERDLFERLLRYGVMCMDVFVLPTTRNQPQMHSSMRTKDEK 
DALESLANVFTTIDHAIFREIFEKYMDFLIERIYNRNYPLQLMVNTFLVRNEVPFF 
ASTMLSFLMSRMKLLEVSNDKTMLYVKLFKI1FSAIGANGSGLH6DKMLTSYLPE 
ILKOSTVLALTAREPLNYFLLLRALFRSIGGGAQDILYGKFLQLLPNLLQFLNKLT 
NLOSCQHRIOMRELFVELCLTVPVRLSSLLPYLPLLMDPLVCAMNGSPNIVTQG 
LRTLELCVDNLQPEYLLENMLPVRGALMOGLWRWSKAPDTSSMTAAFRILGK 
FGGANRKLLNQPOILQVATLGDTVOSYINMEFSRMGLDGNHSIHLPLSELMRW 
ADQMRYPADMILNPSPAMIPSTHMKKWCMELSKAVLLAGLGSSGSPITPSANL" 
FKIIKKLLEDFDPNNRTTEVYTCPRESDRELFVNALLAMAYGIWNKDGFRHVYS 
KFFIKVLRQFALIGVLEYIGGNGWMRHAEEEGVLPLCLDSSVMVDALIICLSETS 
SSFIIAGVMSLRHINETLSLTLPDIDOMSKVPMCKYLMEKVFKLCHGPAWYARS 
GGINAIGYMIESFPRKFVMDFVIDWDSIMEVILGTVEEISSGSADSAYDCLKKM 
MRVYFIKEEGQEEENLTLATIFVSAISKHYFHSNERVREFAIGLMDHCMVHSRLA 
PSLDKFYYRFKEFFEPELMRVLTTVPTMSLADAGGSLDGVQNYMFNCPDGFDF 
EKDMDMYKRYLSHLLDIAQTDTFTLNQRNAFKKCETCPSHFLPPFPITTHIDSMR 
ASALOCLVIAYbRMKKQYlDKGIELGDEHKMIEILALRSSKITVDQVYESDESWR 
RLMTVLLRAVtDREtPElAEKLHPSLLKVSPi^ 

DRHISYNDIMKFKCLVELNPKILVTKMAVNLANOMVKYKMSDKISRILSVPSSFT 

EEELDDFEAEKMKGIRELDMIGHTVKMLAGCPVTTFTEQIIVDISRFAAHFEYAY 

SQDVLVNWIDDVTVILNKSPKDVWKFFLSRESILDPARRSFIRRIIVYQSSGPLRQ 

EFMDTPEYFEKLIDLDDEENKDEDERKIWDRDMFAFSIVDRISKSCPEWLISPNS 

PIPRIKKLFSETEFNERYWRALTEVKKFOEEIIVKRMTEHKYKVPKLILNTFLRYL 

RLNIYDYDLFIVIASCFNGNFVTDLSFLREYLETEVIPKVPLQWRRELFLRIMQKF 

DTDPQTAGTSMQHVKALQYLVIPTLHWAFERYDTDEIVGTAPIDDSDSSMDVDP 

AGSSDNLVARLTSVIDSHRNYLSDGMVIVFYOLCTLFVQNASEHIHNNNCKKQG 

GRLRILMLFAWPCLTMYNHODPTMRYTGFFFLANIIERFTINRKIVLQVFHOLMT 

TYQQDTRDQIRKAIDILTPALRTRMEDGHLQILSHVKKILIEECHNLQHVQHVFQ 

MWRNYRVYYHVRLELLTPLLNGVORALVMPNSVLEKFSWQTRRHAVEICEMV 

IKWELFRTLKTDHIISDEEALEVDKQLDKLRTASSTDRFDFEEAHNKRDMPDAQ 

RTIIKEHADVIVNMLVRFCMTFHONSGSSSTSOSGNHGVELTKKCQLLLRAALR 

PSMWGEFVSFRLTMIEKFLSIPNDNALRNDISSTAYANTIQNAQHTLDMLCNIIPV 

MPKTSLMTMMROLQRPLIOCLNNGAONFKMTRLVTQIVSRLLEKTNVSVNGLD 

ELEOLNQYISRFLHEHFGSLLNCRNLSGPVLGVLGAFSLLRTICGHEPAYLDHL 

MPSFVKVMERAAKEHLAYVANSODGNMVKNFFPDVAELLCACMELVRPRVDHI 
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nSQDDSKDAGEPKEKEKLTLELLLAGQQELLDEASNYDFADALDTVSQITFALN 

fnS5?sSvwlfksfwsslsoseiedftal^ 

^APGLAGDIETLESLGTLYNEISEFDQFAAIWERF^VFPDTMRAMSA^^^^ 

MELAQSYLEKSMSSTYETLAPTINPNNTSNSEKHVSPIIDKEYDHWMEM 

qFLLQWONVADVCNGKDMQHVRGLINAASHIPDWNWEECKSQIAGCIPPSFH 

fnrrLFNLMSTVMRMNENSSPTHMKERCKIAIOECTEAHISRWRALPSW^ 

LvKLQAMNLVREIEESTDIRIALLEAPSNKVDQALMGDMKSLMKVFRNRTPTTS 

nnMGFVSTWYDWRNOlHGMMLQRFEYWDKVGLNVAATGNQSIVPlHSMAQA 

OLAVAKHAKNLGFHNLTKDLLNKLAGLTAIPMMDAQDKVCTYGKTLRDMANSA 

?DERVKNELLCEALEVLEDVRIDDLQKDQVAALLYHRAN1HSVLDQAENADYTF 

qAASQLVDLQNSVTTTGIKLMKNWGHHLYKRFFSTTVCKETGNNFGRQALACY 

fDXrvdndikarkpiakilwlskhlnacgshevmnrvikkqlhslnlfnwlywl 
msmdvsdedcfaddppfdrilkicl 

rhaiclkdqmfkdfseomdatfnemqysedvtmmtlrwrkqleedlvyfqqn 

ynldfleirnkrkmivtkgcmgveksqimfekelsqvftepagmqdefdfvtn 

mtnmmvsqldihavdaprpqgyirivldwirairrrfdrlprriplessspylar 

fshrtgciempydllnvlraknhtlmasnqtgqyismlsrfepnfeiv 

rkiyirgqtgksaafylkksvodeptnrvpomfkhldhvlqtd 

pmOMRVGQKTTLYEVASVQPYAMPPDCTRNYPASOIDIVHPYDVa^ 

cyypddmvlhfferfaossssigqplptptnqdgtvapprlteahhikni^ 
ardmipfrllydyltarypdpvmyyamkkqllhslavlstieyhcnltpmgpdq 

dllwsmaaaskclmkkepevimrplnaa/defan 
ngvasklrntnsadaklrkddcvslisrakdsdnlarmpptyhawf 
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Ar-nTATRCTCAGMTCTATGCCTGCTTGGCAAACTTTTTCTGGATCACAAGA 

cIa^gggtc^^^^ ■ • 

ASMGmGCTCATCGAATTCAGCTATG/^^^^^^ 

MGACAGGATCACCCGAAAAACCACTATCAGATIT^ 
rATCGTACTGGTCAATGGCCATCATGAAAGAGCTTTTCGCATTCAAAAGAC^ 

^rlTrCAGGCGMGATATCACAGTTCAGGACAm 
A^ACGAGMGATG?^^^^^ 

^ScMSTMTTGTGATOGTGATGAAAAGCGTCAAGm 

r^CGGATTGAGGCTGCGAAAAAGAAGAC 
G^?GGCGACCGAAAGAGTACGGAAAGAAAAG^^^ 
A^TTCGTGTTTAeGGCTAAAAACTGAAAA 
^CATTI^TTTTTTCAAAAAACCAAAAAAAAACAAT^ 

aIaaTGGAAATACTGGTTTACAGTGTCAAAAAATTGAAATTTTCTAATAAAAT 
AAAATGGA^ 

??^fcAGMA^ 

irAS^TAG/^TATTTTCACAAAACATACTATTC^^ 
?r?AAAAACCGGCA^CAA^ 

?SScATCTA^AC^^^^ 
TG^TGCAGT^X^ 

r^I^GTGMGmTCTAATCATTAAGCGTTCTGAAGATATTCGGC^^ 
rC^GAGCGATCAGATCACGGCGGGAACGAGTTGAGGCGTAGACATGCTTG 
rScCAGTGACMCCTGAAAGATATTCAAAAA^ 



I^PrMGCGMAGCGCGCTCCAATGTAAAACGAAA^^ 
ACGTOGGTCCCGTTAGGAATTTGTTATTTTTTCGGTTAmCTGACTA^^^^ 

aScg'/J^cgacaagtat^^ 

AAAACAACAAAAAACAATCGAAAAAATAGTGAAAAAGTTTGAATTTACAGTCT 

Scttcttctgttcatcctcaatctcgtcaatgtattgattggac^^ 

aCAATCTCGGACTGTTCCGCAmTCATCCTTCAA^^^ 
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TGMTTGAGCCACCTTCTCCTCTCCGAMGCCTTMCC^^^ 

Igcttctttcaagttgccctcggccttctccttggcatctc 



I 

r 
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^rTTATTGGTCATGATGGCTTCACMGMGMGAMGATGG^ 

A^fficBATGCCGAGCTGCAAATGGnCAATT/^TTC^^^^^^ 

rrA^n-GCAACCGAAGACTTGACGAATGGGTTCAGTCTGATAGGCTCAATTTA 

rrrTCGTGTGAGCTACCAAAAAAAGGAGGAAAGAAAGGAGCACACTTGCG 

rRAAGAAAATCGAGATTCGAATGAAAATGAAGGAAAGAAAAGCGGCCGAAA 

AdGAAAGATTCCACTACTTCCGATGGATGATCTCAAGGCGGAATCCGTAGA- 

?CCXrTM:AAGCAATTTCA^^ 

rrTTCGATGTCGATGGTCGGCCATAGTGAAGATGCAATGACAAGGATCCGA 

AATGTCGAATGCATTGAACTAGGAAGATCACGAATTCAGCCATGGTACTTTG 

rACCTTATCCACAACAATTGACAAGTTTGGATTGTATTTATATTTGCGAATTT 

TrTCTGAAATATCTAAAGTCGAAAACTTGTCTGAAACGGCACATGGAAAAAT 

rTGCAATGTGTCACCCACCTGGCAATCAAATCTACAGTCACGATAAACTTTC 

GI^^C^TCGACGGCCGCAAAAACAAAAGCTATGCTCAGAATCTATGC 

rTGCTTGCCAAACTTTTTCTGGATCACAAGACTCTTTACTATGACACGGATC 

rATTTTTGTrCTATGTGCTAACCGAAGAAGACGAGAAGGGTCATCATATAGT 

TrrATACTTTTCAAAAGAAAAAGAATCAGCTGAAGAATATAATGTTGCGTGT 

ATT^nGTGTTACCTCCATnCAAAAGAAAGGATACGGAAGTTTGCTCATCG 



^CACTATCAGATTTGGGACTTCTCTCATATCGATCGTACTGGTCAATGGCC 
ATCATGAAAGAGCTTTTCGCATTCAAAAGACGACATCCAGGCGAAGATATCA 
rAGTTCAGGACATTTCACAAAGTACATCGATTAAACGAGAAGATGTTGTGTC 
AACGTTACAGCAACTTGATCTATACAAATACTATAAGGGATCATACATAATTG 
TrAtTAGTGATGAAAAGCGTCAAGTTTATGAGAAACGGATTGAGGCTGCGA, 

1i2agaagacacgaattaatccagcagctctgcaatggcgacccaaag 

TCGGAAAGAAAAGAGCGCAGATCATGTTCTAG 



Title: RB PATHWAY AND CHROMATIN REMODELING 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant(s):Horvitz ef a/. 
Filing Date: September 12, 2003 Serial No.: N/A 
Page 29 of 91 Customer No.: 21 559 



HGUREM 



HAT-1 protein 

MTEPKKEIIEDENH(plSKKIPTDPRQYEKVTEGCRLLVMMASQEEERWAEVISR • 

CRAANGSIKFYVHYIDCNRRLDEWVOSDRLNLASCELPKKGGKKGAHLREENR 

DSNENEGKKSGRKRKIPLLPMDDLKAESVDPLQAISTMTS6STPSLRGSMSMV 

GHSEDAMTRIRNVECIELGRSRIQPWYFAPYPQQLTSLDCIYICEFCLKYLKSKT 

(jLKRHMEKCAMCHPPGNQIYSHDKLSFFEIDGRKNKSYAQNLCLLAKLFLDHKT 

LYYDTDPFLFYVLTEEDEKGHHIVGYFSKEKESAEEYNVACILVLPPFQKKGYGS 

LLIEFSYELSKIEQKTGSPEKPLSDLGLLSYRSYWSMAIMKELFAFKRRHPGEDI 

TVQDISOSTSIKREDWSTLQQLDLYKYYKGSYIIVISDEKRQNA'EKRIEAAKKKT 

RINPAALQWRPKEYGKKRAQIMF 
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A. 



epC' 



su I 1 

n4076 



0.5 kb 



B. 



n4077 

1.0 kb 
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epC'1 genomic sequence 

TTTCAAAAAAAAAAAATTACCTCGTCAATTT CACTCT CGTCGATGCGATGATT 

ATCCTCGTCCATmACCTGAAAAGTGTGATTTTTTCACGAATAAAATTATTTT 

CAGATACTTCTAGAAAAAAAAAACTGAACGGAATGnACGAAATTAATTTTCA 

A^GTTGCGAAACTGAATTTTCGAeAAAAAGTTTCACTGATATTCArrTCAAGC 

AtATTGCAACGTTTTTAAAnAATTTCTAAGAGAAAAAACTGCAAAACAATTC 

GAMATAATTTTTACAAGnACTTTTCGAAAAAGTAAC AAAAA TCCACTAATG 

AACAAGAAATTTTTGAACAAAAAGAGCTTCTCAGGCTATTTTrGGACGAATAT 

TTTAATAAAACTTTAAAAAAATCAACGAAAATCCCCTAAAAATCGCTGAAAAT 

TCCAAAAATTAAAGTTCATTCTCGACCACACGTCTCGTAAATCAGCACGAGA 

CTCACGCAACGCGACCGCGCCGCACTCAACGGCATTGAGTAATGCGGAGC 

GGCAGCGTCGCGTCGTCTATTTGTGTGtGTGTGCGATTGTGTGTGGTGCGA 

CGTGGCCGCTCTGTGTGCCTCTCTAGTGAGTGTTTTCCGAGGAGAGACAAC 

ACATTTTCGAGAGACGAAGAGAGTGGCGACGAGGAAGATAGTGTGGTAAGA 

GGAGAGTGTGCGCGAGGGAAAGAGAGCAAAGTGTGAGTGTCTGTGAGAAG 

AGAAGGAGACCCCCCCCCCCCCGCGCTCAACCAGTCGATAGTTGGCCTGA 

GTGTAGGGCCTTCTGTTGTATTCCACTGCTAACCCCCCCCAAACACACAAAA 

AGACTCAAAAAGTACTGCTTAAAACACAGTGCTCAGCTCATTTCATTTTTGAT 

TTTTATGCTCGCCGTCATCGGCGGATGAATTCATCGCAAAGTCCGTGGCGA 

TTCAACACGTGCGGCGTCCTCGCCGCTCTTCTTAACCGTAGTTACAACGTG 

GGAGTACAGAAAGgpSCCACTACTTCGAAGGCGTTTCGAGCCCGGGCGC 

TCGACTCGAACCGGfCTATGACTGTATACTGGGGCCACGAACTTCCGGACC 

TATCAGAATGCAGTGTTGGAAACCGGGCGGTGACACAAATGCCGiTCTGGCA 

TGGAAAAAGAAGAAGAACAGGTTGGTTTTTGGTG6ATTATGGATTACTGCTC 

CAnTTGAAATTTTTCGAGTTTTAATGTCTTTTrTCGAATTCCTGGTGCrrTTT 

TCTATCCGAATCATGTTTTAATTCCGTTTTCCGACTACTTTGAAGAATTTTCA 

MTTTTTGATCCCTGATGACGTCACTATTTTTGTCTTTGCCTTTCTGGATCGC 
TmATAGTOTTTTCATTTTTTATTTCTTTTTTACACT^ 

TCnA ATTC ATCCTATTCTATTTAATTTTAAGTTTTGATTTTTGATTTTTGATTT 

TTCTCTTTTCTCTTTTAGCCGCCGGTGGGCCTTTATTACAACTCTTAAATCAT 

AAAAAAAATCAGTTTAAGCAGTTATACATAACTCTTATTATGAAAAAATCGTTA 

TTnTCGACGGAAACTTCATACTTTGAATTTATTTCCAATTTAGATTTTATTrT 

CTCAAAGTCAGCTCAATTAACTAACTTAAAATGTTTTGTCCTACCCGCAAAAT 

GmTTTmAATATTTTAATTCTATTTTAATTTTTGGCTTTAAAAAATCATTTT 

GCTAAGCCTGAGATGAAGGCGAAATCTCGAGAAAAAGCATTTAAAAAGTAAT 

AAATTCCGTTAAAAACGACTTTTTCTATCACAGAAAGTGTTCTCTGAGTGCTA 

ACAACCTTCTTCTGTCCAAATTTTGACACAATTTCCCAATTATGCCGACTTAT 

TACACCTTTTTCCGTCAATCTTCTAGTTTTTCCCACCCTCTTGACCCCTGGTG 

ACGTCAmGmGTTCTTCTTCCAAGACATGCCCTGTGGGGTATTTTTTCTC 

AAAATTTTTGCAAATTTATTGGATTCTAAATAAAATTCCAGGAGTCTAGCACC 

AGGAATAATAATGCAAATTTGAAAAAAAAATTAAACAGAAATAATGATTTTAA 

ATGATTATTTAAATTTTAAATTTTAAATTTCCAGGAAAAACACCTGCAAGAAG 

eGATTGG-TGGGGAGGAAGGGAGTAGA-TGGGGT-ATTGAGG-TGAAeeATGTeA 

TTCCAACTCCAAAAGTCGACCGAGTCGAAGATCAACGCTATCACTCCACTTA 

TCACAACAAGAATAAAATGCACCGTTCAAAGTATATGAAAGTTCATGGTGAG 

TTTTTTTAACCAAAATTTCGGCGAAAATAATTTAATTTCGGGTTTTrrGAAATT 
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MTTTCCGCTT<5GGTTTrCTTGTATTrA™TTTTTTGAMTTCCTCTCTGAAT 

TCGAMGAAMTMCTTGArrrTTCAGACTTCCtGGCTAAAACCTTCAAAAAT 

GTTTGTTGATTGGnCCAAATlTTCGCCTGATTCCGAATTTeGAtGTGACAAA 

TTCAAAAAAAAATTCCCTGATmATATTCAAGCTT TGTGm GTGTGT TCTTT 

TTGGAGCGCGCTTGCATCGTrTGAT TTTCTT CGTCl I 1 1 1 I AAAATTTATTTTC 

GCTTGTTTCAnCATTmGTCGAGTTTTTTTCTGCCAAAATGAATGAAACTG 

GTtTAAAAAAnGAATTCGGCGAAAA TAAA TTTTGAAAAACGAAACAAATCAA 

ACGATGCAAGCGCGCTCCAATGCGATTTTTTTGGGC GCGGAA ATTCGTGAT 

TTCAAGCTTAAATATAAAATCAGGTATATTTTTTCGACI 1 1 1 I iCACGTrGAAA 

TTCGGAATCAGAGGAAAATTTTGAGTCAATCAAAAATATTTCCCAGATTTCG 

GTATCTTTAAT6CATCAAAAATGAACTTTCACCCCC ATACTCCCAGAAA AATA 

AGAAAACAAATTGCGAAATATTGTTCCCTGATCAAAl I I 1 1 I CTTTTTTTAACT . 

ACACTTCTCTGTTTTGAAGTGAGAAAGTACATTTTTCTGCGTTTCTTATCAGT 

TATCATTTGAAAAGGATCAGAATTTGATGACGATATATnGTTTA GTTAC CTC 

CCTTTTTTCTGAACAGTTTTTGCGAAAAAAGGAGAAAAACCGGAATTTTCTAT 

GAAAATGTGATTTATTTTCAGCCTGGCAAGCACTCGAACGAGACGAACCCG 

AGTATGACTACGACACAGAAGATGAAGCATGGCTATCAGATCACACTCACAT 

TGACCCGCGCGTTTTGGAAAAGATATTCGACACAGTGGAGAGCCATTCATC 

GGA6ACACAGATCGCGAGCGAAGATTCGGTGATTAATTTGCATAAATGTAA 

GTrGACGAAATTTCCATTGAAACCCCCCCCCCCCAAAAATATCGTTTAATTG 

CAGCACTGGACTCATCAATCGTGTACGAAATATACGAATATTGGCTGTCGAA 

GCGAACATCGGCTGCGACGACGTCTGGTTGTGTTGGAGTCGG TGGA TTAAT 

TCCGAGAGTCAGGACAGAATGTCGGAAGGTAAGA ATTTGA CTATTTTGAAC 

GAATTTCGTGATGAAACTTCTCTAAAACTTTTAAAGi mi l ATGG CGGTTCA 

aaatttcggaaaamacactgattttagctaaaaacttgaattttggtcattt 
gtccgtgtcacatctgtccgaaatcgactttttttggaattatcatcctttat 
tgcacatttggctagtttatctcatttaatttcgtt gatta ctaag gtaca ttt 

AAAGCCAATAGGTAACCAACCAAAAACTATCATAAi I 1 1 I CTA CACT TTTTAA 

ttnccgacactacttgaataaccccataagtgaccaattttgatagtttttg 
gctggttaccggctttaaatgtaccttattaatcaacaaaattaaat gagat a 
aactagccaaatgtgcaataaaggatgataattccat aaaa agtcgattttg 
gacagatgtgacacgggcaaatgaccaaaattcaagi i \ i la gctaaaatca 

GTGTATTTGTTTCGAAGTTTTGAACCGCTATAAAAAAAi I I I IGGAATGCTTT 

tggcaagtttcattacgaaattcactcattttctatacgcaaaaattagaatt . 

ttcaattaaaaattcattttacaggatggacaaggtgttatcaatccgtacgt 

tgcattccgtcgacgtgccgagaaaatgcagactcgaaagaatcggaaaaa 

cgatgaagattcgtatgagaagattctcaagttggtacatgacatgtcgaaa 

gctcaacagctcttcgatatgactgcccgacgagaaaagcagaagctcgcg 

ttgattgatatggaatcggagattttagcgaaacgaatggagatgtcagatt 

ttggtggttctccgagttcgttcaatgagatcaccgaaaagattcgagcag 

cagcaacgttggaagtcgtgaaaccaccactggcagaaatcaacggatcag 

atgaagtgaagaagaggaagaagccgagacgaaagattgctgataaggatt 

tAAtATCGAAAGCCTGGCTTAAAAAGAATGCAGAAAGWGG'AATe 

cgtcgctctttggacaacacagtggaaatgttccgacggttacaacgaagc 
cagttcgagagtcgttggcgaatgggcgatttgcgttcaagcggaggaga 
ggatgtgtttatcgcgcggctctcaccgtttacaatgtgcctacagcgcct 
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GCTACAGTACCTCCAGTACAGACTCAAGCAGCAGTGGCTTCATCATCATGG 
TCAAAATCAACGGATATGGTGCCGTGGAACATGAAGTTCTTTGAAACTTTTG 
TTCGGGATTCACAGGATTCAGTTTCTCGATCTCrrGGGTTTGTACGCeGACG 
AATGGGACGAGGTGGGCGAGTTGTATTC6ATCGGATGCCTCGCAATCGAG 
ACGACAACGACGAACGCACTTCGACAGATCCATGGGCCGAGTATTGTGTCG 
CG GATAGn CAAGGTGAGATTTnGAATAAGAATCTTAATTTCACGAGATTTT 
GGTnTTTT CGCTGCTTTTTCTGTAAmTGTGGTATTTTTTCTCGTATTTTCA 
ATTAAAAAACGGGTmAAATAATTTTAACCTGAAATTTCGCTAAAAACCAAG 
AAATTTCATTAAAAAATGCAACAAAAAAAAAGACTGGAGGCACCACCGAAT6 
GAGAACAGGAGAACCCAAAACCACGCCCATTTTTCCGTGCCGGGCGGCGA 
AAATTTTTGCAGAAmGCTGCAATTrTrCGTTTTACAAACGAAACAACGAAG 
CTCTGAATGTGTTATTTCGGAGCTTCGTTGTTTCGTTTGTAAAACGAAAAATT 
GCAGCAATTTCTGCAAAAATTTGCGCGCGGCACGGAAAAATGGGCGTAGTT 
- TTAGGTTCTCCTGTTCTCCTTTCGGTGGTGCCTCGAGTCTTTTTCGCATTCTT ' 
AATGAAAmCTnGmTTTAGCGAAATTTCAGGTTAAAATTATTTAAAACCC 
GTTTI 1 1 I 1 1 CAATTGGAAATGCGAGGAAAAACCACAAAATCACAGAGAAAG 
CTTTTGGATTTmCGCAGCTTTTTCTGTGATmGTGGTTmcCTCGCATTT 
TCAATTGAAAAAAAAACGGGTTTTAAATAATTTTCACCTGAAATTTCGCTAAA 
AACGAGGAAATTTCATTACAAATGCAAAAAAGACT6GAGGCACCACCGAAA 
CCGAATGCAGCTCAGAACAGGATTTACCAAAACAGGATGCAGTAGGCGGAG 
CCAATTCGCAACCACCGCATGCTTATTTCGCATGCCTCGCACG' I I I I 1 1 I H | 

CTCTTGAAACAATGCAACAATATCAAGGAAAAAACGTGCGAGACTTGCGAAA 

TAAGCATGCGGTGGTTGCGAATTGGCTCCGCCCACTGCATTCTGT7TTGGT 

AAAT TCTG TTCTGAGCTGCATtCTGTTTTGTTGGGGCrrCCAGTCTTTTTTGT 

GCAI I I I lAATGGAATTTCTTCGTTTTTAGCGAAATTTCAGGTTAAAATTATTT 

AAAACCCGTTTTTTTTTCAATTGGAAATGCGAGGAAAAACCACAAAATCACA 

GAGATAGCGAGGCCCCACGAAAAGGGGAGCAGAACAAAAAAGGGGGGGG 

GGGGGCTGGCACTGTGCCAAACGCACAAAACGCTTTTTATTCTTATTCAACG 

CACGACTTTGTTATAACCACACTCCGTTATTACGCATCGCGCGCTGTTTAGC 

GTGAAAATACAAAAAAACGTCGTGCGTTGAATGAGAATAAAAAAGCGTTTTG 

TGCGTtTGGCACAGTGCCAGCTCTCCTTTTCGCAGATCCCCTTTTCGTGGG 

GCCTCAGAGAAAGCTGCCATAAACTTTTTTCTTCGCGCTAAGACCAATACCA 

ATAAATCCnGCGCCTTTAATATGCAAACTATATTTTTCTTCCAGAACGTTCC 

GTGC.TCGAAACAGTTCGCTTGGTACCGAAGAAGAAACCGATGATCTAAGCC 

CGAAATCTCTGTATTTCGCTCGCAGTAATCGGTTCGCATTCAACGATGATGA 

AACTGAACGGGAATGGACTTCAAGATGCCAACAATCATCGTGGAGAGATAC 

AGAGGTGGATGATGAGCTGAAAAAGCGGGAAACAACGTCTGAAAGTGAGAT 

TTTGAACGATTTACCTGGGAAAATAGATTATTTTGGGCCTATTTTAATTATTTA 

ATTGCAGAATTTACCGAAACCACGACGAATGGAAGTACCAAAACACACACA 

GAATCGGATGATAGTGAAGTTGAACGGATGGAGGTTGATGATCAAGTTGAT 

GAAGCTCAAATAAGTGTATCATCATCAAAAGACGATGGAATGAATGGAAATG 

ATAAGAACGAGGATGAAGAAGATGATGATGATGATATGGATGTAGATGAACA 

-TeAGAGTGTGGTGGG-TGTGe-A^eAGGAeeAGeAGeAGeAGeAT-CACCAGC— 

AAAAAGTTGGGGATCAAATGAATGGTGGTGGTGGTGGTGGTGGAGTGGTAA 

AACTGAAACCGCCGCTGCAAGAACTTTGGCCGCCGCTTTCGGGAAACGGAA 

GAGCGGACAGAGCGGAACCGACGCCGGttCCGGCAAAGGTAGTGAGGCTT 

Title: RB PATHWAY AND CHROMATIN REMODELING 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant(s):Horvitz ef a/. 
Filing Date: September 12, 2003 Serial No.: N/A 
Page 34 of 91 Customer No.: 21559 



FIGURE 16 



TTTTTnAAATACTCGAAAAAGAAGGAAAAAATCCCACTTTTAAAAATACGAT 

Tr.TTAAAAATGC6AATTCC.CTCCAAAAT6AGAACTCTGATT 6GCCAGGGA GC 

TCTCATTTTGAATGGAAATTAGTCAAAATTGAAAAATCCCGi \\\\\\\ i lAAG 

TTGGAmTTCATTTTCTCGCGATTTTTTCCGCGTTTCTGTGTCATTCCTGAA 

TTTAACATTTAATAAATTAAAAATGTCTGGAATAT TGACAAATTATGCTTCA AA 

TTTTTTGCGCGGGAGnCAAAAATAATTTGGCCCTT TTTAl 11 11 I ATTTTG CA 

/i^^TATATAAAAAATCATTTTAAAAAATTTAGAAACAl I 1 1 1 1 AAi m 1 1 1 AA 

CAGTTATATTCGCTATATTGGGACGGTATTCTGTCATTAAACTTGGTGT TGTC 

GAATTmTTTAnGCTTTATAAGACTCAAAATTGTCTGAAAACACCGAATTTT 

ATAATGAAACTTCTTGGAAACTTCTCAAAAAAAAGTTATGACGGCTCAAAAAA 

TGACCTAAAATTTGTTAAAATTTGAAATTTGACTTGTCGCAACGGCTG GAAA C 

/^TTmTTTTTTGAAATCACCGTCAAATTTTGAGTATAAAATTTAATTATTTTG 

CGTTTTCAACTCGATTTTTGGTATTTTCAAGTCGATGGACGGCAAGATTTGG 

TTAAAAAATTAAAAGCCGTCCATTTTCTCGCCGTCCATTGACTTTAAACTACC 

TAAATCGAGTTGAAAACGCAA6ATAATTGACATTTATACCCAAAATTTGACTG 

Tf^GTTTTAAAAAAGTTAGTTTCCAGCCGCTGCGA CAAGTCAA ATTTC CAATTT 

TAACTATTTTAGGCCATTTTTT GAGCCATCATAACTT TTTTT TGA6AAGTTTTT 

AAGAAGTTTCATCATGAAATTCGGTGTTTTCAGACAATTT TGAG TCTAATAAA 

GTAATTTTAAAAAATTCGACAGACACCACCTTTATAGCAATTnG AATTT TTTT 

TTAAACTTGTCTTGAAAAATCTTGAAAAAAGTCGAATAAATTCCCATTTTCCT 

ATTTTCTTnTFGCAGATGTGCGGAAGGGTGTGGGACTCAGATGATTGGAGA 

GAGCCGAGTGGATCACCATCAGAATCGAATTGATCAACCGAATGGGGTGGC 

TATACGCCACAAGAACAGCATGCAGTTGTTGTTGCCAACGCGGTAGCTGTC 

GCTTTCAAGGAAAAATTGATGAATGGCGTGGATGATGATGATGATCAACAAC 

CATCGCCGGCTAGAGGAGCACGAGATCATTCCATCAAAGAGTTC GTTA GTT 

TTTCTTTGCTTTTTTTTTTTTTGATTTTTGAGAGCAAATTTGAAAAGrrrTACA 

r.GGTTnTGAAAAACTGTTGAAATTAAAATTTGTTGAGAATT TGATT TCGAGC 

AAGTTTTATTTTTAAAAAATTGAATTTOCAGAAAATTCTdAGTtTTCTTTTTAA 

AAAATTGAAATTTTCAGAAAATTCTGAGTAGCAAGAATCTTTA AGATCCTTA A 

TTTCTATGCAAGAATACGTAGGAGTTTTACTTTGCTCAGGAAATTTTATTTTTT 

GTCAGAGGAGTATATCCGAAAAAGAACAAAAAAAATGCACATTTCTCAAAAC 

GCGTATTTTTTTTTCAGTTCGATGTCAACGGTAACACTGCTGGAACGGAAAA 

AGTTCATGATGCCGTCGACAATCGGTCTA^^n TGAA CTCTCTGCTGCTGC 

TTCTGCTACTGCTGCTACTGCTGCTCATCGCCAATTTTCAATCCTCCTGAGA 

TTTTTTGATGGTCATTCATTGTTTTGTGCATATCTCTCTCTCTCTCTCTCTCTC 

CCATGATTCTCAAATATTTCAATGTATTTACACCCCCACTCTGTCCGCTGCCT 

AATCCCCGACCGAATAATCAGATTCGCTGGAAAAATCTGCGATTCTTTAATA 

TTGCAACCACCCACCCAATAATATGTGTCTCATCATCTCGGT ACTCT CACTT 

GAGCCGTGTTTTCTGTAGTATTTTATTCTCTAAAAAAAAATCATTTTTAATATA 

ATATAC'GTACA-CATTTATATCTGT-AATATATATTTTTAA AAAT GATTCCCCCCT 

CCCCTCCATTCGTTGTTTTTTTTCTGTGGGTTTCAAGCTTTTGAGCTGTGAAA 

AATCTCATCCCATCATCATTTTCTATTGTTTTTTTTCACAGTTGAAATATCCTA 

tTTTATCTTmCCTTTTTT-m-eATTTT^^ 

TTTCGTCCCGCGAAACGCCCGCCGCCGCCCAATCCCACTCTCTCTCTCAGT 
CTCTTCTTAATGATCTTCGAAACTATTTTTATTTCCCTCATTAACAATTACGAG 
GTCGTCTTTTTTTTTCCCCACCCCCCACTGTTTGGTGTAATTTTTGTGTTCGG 
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GGAGGTTTTTTGTGTGTGGAnTTTGGATTTTTTGGAl I I 11 ICAACAAAAAA 
TTCCCCCGAAATCAAAAmTTTCCCATTTTCCCCTCAATATTAGTA CTGTTG 
TAtAAATAAACTTGCTCTCTCTCTCTCTCTCGAAATCTCCTACTArrATTTTTT 
T AAAAGATTTTTCCAACAAAAATTCAAAAAACCACACAAACGAC CTCTCTGC A 
CeCGGTAATCCTCTCTCTTTTTGTCCCCCATTnCTCTGmCTCTTTTTTtCT 
AtCCCCTATACCTGTGATTGGAATATG • 
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ATGGCCACTACTTCGAAGGCGTTTCGACCCCGGGCGCTCGACTCGAACCG 

GTCTATGACTGTATACTGGGGCCACGAACTTCCGGACCTATCAGAATGCAG 

TGTTGGAAACCGGGCGGTGACACAAATGCCGTCTGGCATGGAAAAAGAAGA 

AGAACAGGAAAAACACCTGCAAGAAGCGATTGCTGCCCA6CAAGCCA6TAC" 

ATCGGGTATTCAGCTGAACCATGTCATTCCAACTCCAAAAGTCGACCGAGTC 

GAAGATCAACGCTATCACTCCACTTATCACAACAAGAATAAAATGCACCGTT 

CAAAGTATATCAAAGTTCATGCCTGGCAAGCACTCGAACGAGACGAACCCG 

AGTATGACTACGACACAGAAGATGAAGCATGGCTATCAGATCACACTCACAT 

tgacccgcgcgttttggaaaagatattcgacacagtggagagccattcatc 

ggagacacagatcgcgagcgaagattcggtgattaatttgcataaatcact 

ggactcatcaatcgtgtacgaaatatacgaatattggctgtcgaagcgaaca . 

tcggctgcgacgacgtctggttgtgttggagtcggtggattaattccgaga 

gtcaggacagaatgtcggaaggatggacaaggtgttatcaatccgtacgtt 

gcattccgtcgacgtgccgagaaaatgcagactcgaaagaatcggaaaaac 

gatgaagattcgtatgagaagattctcaagttggtacatgacatgtcgaaag 

ctcaacagctcttcgatatgactgcecgacgagaaaagcagaagctcgcgt 

tgattgatatggaatcggagattttagcgaaacgaatggagatgtcagattt 

tggtggttctccgagttcgttcaatgagatcaccgaaaagattcgagcagc 

agcaacgttggaagtcgtgaaaccaccactggcagaaatcaacggatcaga 

tgaagtgaagaagaggaagaagccgagacgaaagarrgctgataaggattt 

aatatcgaaagcctggcttaaaaagaatgcagaaagttggaatcggccgcc 

gtcgctctttggacaacacagtggaaatgttccgacggttacaacgaaggg 

agttcgagagtcgttggcgaatgggcgatttgcgttcaagcggaggagag 

gatgtgtttatcgcgcggctctcaccgtttacaatgtgcctacagcgcctg 

ctacagtacctccagtacagactcaagcagcagtggcttcatcatcatcgtc 

aaaatgaacggatatggtggcgtcgaacatgaagttctttgaaagttttgtt 

cgggattcacaggattcagtttctcgatctcttggctttgtacgccgacgaa 

tgggacgaggtgggcgagttgtattcgatcggatgcctcgcaatcgagac 

gacaacgacgaacgcacttcgacagatccatgggccgagtattgtgtcgcg 

gatagttcaagaaccttccgtgctcgaaacagttcgcttggtaccgaagaa 

gaaaccgatgatgtaagcccgaaatctctgtatttcgctcgcagtaatcggt 

tcgcattcaacgatgatgaaactgaacgggaatggacttcaagatgccaac 

aatcatcgtggagagatacagaggtggatgatgagctgaaaaagcgggaaa 

caacgtctgaaaaatttaccgaaaccacgacgaatggaagtaccaaaacaca 

cacagaatcggatgatagtgaagttgaacggatggaggttgatgatcaagt 

tgatgaagctcaaataactgtatcatcatcaaaagacgatggaatgaatgga 

aatgataagaacgaggatgaagaagatgatgatgatgatatggatgtagatg 

aacatcagactgtcgtgggtgtgcatcagcaccagcagcagcagcatcacc 

agcaaaaagttcggcatcaaatgaatggtggtggtggtggtggtggagtg 

gtaaaactgaaaccgccgctgcaagaactttcgccgccgctttcgggaaac 

ggaagagcggacagagcggaaccgacgccggttccggcaaagatgtgcg 

gaacggtgtcggactcagatgattggagagagccgagtggatcaccatga 

gaatcgaattcatcaaccgaatggggtggctatacgccacaagaacagcat 

gcagttgttgttgccaacgcggtagctgtcgctttcaaggaaaaattgatg 

aatggcgtggatgatgatgatgatcaacaaccatcgccggctagaggagca 
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CGAGATCATTCCATCAAAGATTCGATGTCAACGGTAACACTGCTGGAACGG 
AAAAAGTTCATGATGCeGTCGACAATCGGTCTATAA 



Title: RB PATHWAY AND CHROMATIN REMODELING ^ 

GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 

Applicant(s):Horvitz et al. 

Filing Date: September 12, 2003 Serial No.: N/A 

Page 38 of 91 Customer No.: 21 559 



FIGURE 18 



MA?TSKAFRARALDSNRSMTVYWGHELPDLSECSVGNRAVTQMPSGMEKEE 

poJkhlqeaiaaoqastsgiolnhviptpkvdrvedqryhstyhnknkmhrsk 
vSvhawqalerdepeVdydtedeawlsdhthidprvlekifdtveshssetqi 
IITpdsvinlhksldssivyeiyeywlskrtsaat^ 

rncWINPYVAFRRRAEKMQTRKNRKNDEDSYEKlLKLVHDMSKAQQLFDMTAR 

R?KQKLALIDMESEILAKRMEMSbFGGSPSSFNElTEKI^ 

r^nEVKKRKKPRRKIADKDLISKAWLKKNAESWNRPPSLFGQHSGNVPTVTTK 

PVRESLANGRFAFKRRRGCVYRAALTVYNVPTAPATVPPVQTQAAVASSSSSK 

qTDMVPSNMKFFETFVRDSQDSVSRSLGFVRRRMGRGGRWFDRMPRNRDD 

MnFRTSTDPWAEYCVADSSRTFRARNSSLGTEEETDDLSPKSLYFARSNRFAF 

wnnETEREWTSRCOOSSWRDTEVDDELKKRETTSEKFTETTTNGSTKTHTES 

nnqFVERMEVDDQVDEAQITVSSSKDDGMNGNDKKEDEEpDpppMp 

TWGVHQHOOOQHHOQKVRHOMNGGGGGGGWKLKPPLQELSPPLSGNGR 

InRAEPTPVPAKMCGTVSDSDDWREPSGSPSESNSSTEWGGYTPQEQHAW 

VANAVAVAFKEKLMNGVDDDDDQQPSPARGARDHSIKDSMSTVTLLERKKFM 

MPSTIGL 
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ssl'l Genomic 

caoctqatgt- tgttgatgga aaaatgacgg ctgcaaagaa gccattggct gcaactgagc 60 
cSaaSgca taataaataa atgtgtttct aggatcttct aataattttt tttctgtttt 120 
rtaactctaa acttgtattt atttcattct tgttctacca aattcccacg gattctacgc 180 
tttltatttc taaattatta ttctttttta tttatatctg cattttcttc taaaaactct 240 
aatcattttc ttgttttttt cttggtaatt ataaaaatta gtcatacaaa tcttgttaaa 300 
tatctqgcta ttcagtgaac ^aaccatttt ccgctctaaa ttcgacccga atcaatcgaa 360 
aaatqgctca aaacgatgcc atctggctgc aacccccctg tcgtctctca attttgtgta 420 
ctctctcgca gccacgcacg cgacgcaacg cactcgcgtc gcggtcgcag ttctttttca 480 
aatttatcgc gccatttttg ttttgcctca tatttatcgg ctcacgattg attttcgtcg 540 
aaaaacgcgc ttaatcgatt cctttttacc tgaaaaatgt tgttccaatt ggaaaaccag 600 
ttaaaaatcg atgaattttc aagaaaatca ttcaaatagg caaaacccgc tgaactttga 660 
aattcgattt ttgagttttt tgaagaaaat ataattattt catcatttat gttggtcctg 720 
ttqqtcctca gcatagaaaa ttcggacatg acattagaaa ttcataataa ctgctcccaa 780 
tatcgqgatt agaacgattt tcagctcaaa atatggaaaa ttggttacat aaaccgcata 840 
tttqtagcat taatcttgaa cagctatatg gcattaaaaa aaaatatata tatacattgt 900 
tttttctctc gaagtttctc tttttgtttc taaaatccgg aatataattt aaaaaaccac 960 
ataaatttca atttgcagta cgagttcccc ccgaatcaca atg ccg gca aca ccg 1015 
" Met Pro Ala Thr Pro 

1 5 



qtq cgt get tea agt act cga ata age aga cgt aca tea tea aga tea 
val Arg Ala Ser Ser Thr Arg He Ser Arg Arg Thr Ser Ser Arg Ser 
10 15 20 



1063 



ata qct gat gat cag cca tea act teg tct geg gtg get eca cct cct 1111 
val Ala Asp Asp 61n Pro Ser Thr Ser Ser Ala Val Ala Pro Pro Pro 
25 30 35 

tea ecc att gcc ata gaa act gat gaa gat geg gta gtt gag gag gag 1159 
Ser Pro He Ala He Glu Thr Asp Glu Asp Ala Val Val Glu Glu Glu 
40 45 50 

aaa aag aag aaa aag aca tea gat gat ttg gaa att ate act cca aga 1207 
Lys Lvs Lys hya hye Thr Ser Asp Asp Leu Glu He He Thr Pro Arg 
55 60 65 

act eea gtc gat egg ega att cce tae att tge teg att ett ttg act 1255 
Thr Pro Val Asp Arg Arg He Pro Tyr He Cys Ser He Leu Leu Thr 
70 ''5 80 85 

gaa aat cga teg att cgc gat aaa tt gtacgatttt ttaaatttaa 1301 
Glu Asn Arg Ser He Arg Aep Lys Leu 
90 

ttactttcct caaatccgaa taattattag atcgegcttc gcgtttetge atccgeggta 1361 
ttttgccttc ccactgaaaa tagcagattt atcgaatttt tagcttaaaa aaaaaatgtt 1421 
ttttctgcat tttteaaaca aacettttgt aaaaeagtga aaatcgaatt tcaaatgact 1481 
aaaatgaatt ttttttttgt eeactggttg tggaatggtt tgaatttgaa gaaatcagcg 1541 
««»t«-tttca tattttetqa atatttttet . attaaaaat^^ 

cttttgaata gaaaaatatt gagaaaatac gaaaaatcea gctaacttcc agcttgttca 1661 
aattcaaacc attccacaac cagtggacga aaaaagttea ttttagteat ttgaaatteg 1721 
atttggtttg tttgaaaaat gcaaaaaaaa aatatttttt aaagctaaaa atttgataaa 1781 
tetgaaaaaa atetgctatt ttcagtggaa aggcaaaata ccgcgaagcg cagcaagcgc 1841 

Title: RB PATHWAY AND CHROMATIN REMODELING 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant(s):Horvitz et al. 
Filing Date: September 12, 2003 Serial No.: N/A 
Page 40 of 91 Customer No.: 21 559 



FIGURE 19 



gctctaataa ttattccgct tcgagaagag cgtgtattat ttcattgtta catttcaaaa 1901 
ttatgaatta atgtttttca g g gtt ctg age age ggt cca gtt cgt caa gaa 1953 . 

Val Leu Ser Ser Gly Pro Val Arg Gin Glu 
95 100 

gat cac gaa gaa cag att get cga get caa egg ata cag eca gtt gtc 2001 
Asp His Glu Glu Gin He Ala Arg Ala Gin Arg He Gin Pro Val Val 
105 110 115 120 

gat caa att caa cga gtc gag caa at gtatgtgaag ctgaaaaatt 2047 
Asp Gin He Gin Arg Val Glu Gin He 
125 

gcaccacaaa tcaattattc taatcttgtt ttacag c ata etc aat ggt tea gtg 2102 

He Leu Asn Gly Ser Val 
130 135 



gali gat att ~ctg aaa gat Vet cga t'tc^ca gta atg gca gat etc aca 
Glu Asp He Leu Lys Asp Pro Arg Phe Ala Val Met Ala Asp Leu Thr 
140 145 150 



2150 



aaa gaa cca cca cca aca cet gca cct cet cet cca ate cag aag aca 2198 
Lys Glu Pro Pro Pro Thr Pro Ala Pro Pro Pro Pro He Gin Lys Thr 
155 160 165 

atg caa ccg att gag gtg aaa att gag gat tea gag gge tea aat acg 2246 
Met Gin Pro He Glu Val Lys He Glu Asp Ser Glu Gly Ser Asn Thr 
170 175 180 

get caa ccg agt gtt ctg eee agt tgt gga gga gga gag acg aat gtg 2294 
Ala Gin Pro Ser Val Leu Pro Ser Cys Gly Gly Gly Glu Thr Asn Val 
185 190 195 

gaa aga gee gee aaa aga gtgagttttg aagatagatt ggtgtgtaaa 2342 
Glu Arg Ala Ala Lys Arg 
200 205 

aaatgaatgt ttatatattc actgcaactt tttcctcacg agggacgagg. aaaagtggtt 2402 
tctaggecat ggccgaggtg ccgacaagtt tcagcggcca tttatcttge tttgttttcc 2462 
gcetgttttc tttcgttttt catcgatttt tttcgttttt tcttaataaa actgataaat 2522 
aaatattttt tgcagatgct aaaacaattt ccaagtaaaa aaattatgta tteagtgggc 2582 
aagcagcggt gaaagtggtc aatgcaatat gatggattac gggaataeaa aacctaaaet 2642 
ttttctgaaa catgatacat acgctgctta aatgctgaga ctacctgatt ttcataacga 2702 
gaccgctgaa aaagttttga ggttttcaaa attcaaattt tttggtgaaa aagtcgagat 2762 
tttegcacaa aaagttgaat tctgaaaaec teaaattttt tteagcggte tcgttatgaa 2822 
aatcaggtaa ttteagcate atatgtatea tgtttcaaaa aaagtttagg ttttgtattc 2882 
ccgtaatcca teatattgca ttgaccaett tcaccgctgc ttgcccactg aatacatgat 2942 
tttttacttg gaaattgttt tagcatctgc aaaaaatatt tatttatcag ttttattaag 3002 
aaaaaacgaa aaaaatcggt gaaaaacgaa agaaaacagg cggaaaacaa agcaagataa 3062 
atggccgctg aaacttgtcg gccectcggc catggcctag aaaccacttt tcctcgtccc 3122 
tcgtgaggaa aaagttgcag tgttattgta aatctcacaa gagtctggca tgatttctca 3182 
aaggcgcatg gatttattea gccctaaaat taaataaatc catacgaett taaaggtgga 3242 
gttcggsaaa .tgaggatttt .actttaaaat~gctcaaacta-gtcccaaatg ccgaattacc -3302 
acaaaagaaa aacggaaaaa aattcatcaa gtttgaaaaa aatgcggatg attttgttga 3362 
aatttcaacg ctegctaata ttcctaattt gaaccgcget tttgtccgeg ecgcactctg 3422 
tagaattgca tccgcgetgt ttccttccte ttccggcgcc ctacttettt tcgattggaa 3482 
atgatgaaaa aatgagacaa aactagaatt cacgtagcgc gtcggaaatg atgaaaatat 3542 
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catggatgca gcagatctac ggagtgcggc gcggacaaac ggcgcggtaa ttcaaatgag 3602 
gaatattagc gagagttgaa atttcaacaa aatcagccgc atttttttca aacttaatgt 3662 
attttttttc gtttttcttt tgtagtaatt cggcatttgg ggctagtgta agcattttaa 3722 
agtaaaatcc tcattttccg aactccacct ttaaaggtgg agtaccgaaa tttgagactt 3782 
tgctttttta ggcccaaatt ggtccaaaac taccgaattt tgtaatgaga cgttctgaaa 3842 
atttatccaa aaaatgttat ggcggttcaa agttcggcaa aatagggccc attttcagct 3902 . 
aaaatcaaat ttttttttcc aactttttcg gtgtcgcaac gtctggagcc taatttttat 3962. 
ttattaatca ctttttaata aatattgtag cctttgatta ggcgtttatt cgctgattta 4022 
agtacattta tggtttttgg ggcacaaata aaagtttcat tttatgcccc aaaaaccata 4082 
aatgtactta* aatcagcgaa 'taaacgccta atcaaaggct acaatattta ttaaagagtg 4142 
atgaataaat aaaaattagg ttccagacgt tgcgacaccg aaaaagttgg aaaaaatttt 4202 
gattttagct gaaaatgtgc cttattttgc cgcgaacttt gaaccgccat aacttttttt 4262 
gagaaagaaa ttttcagaac gtctcattac gaaattcggt agttttaaac caatttgggt 4322 
ctaaaaagtt tcaaattcca ataaaacata ccaaagtctt gtgaaattac aataaactat 4382 
tcctaaacgt attataatcc attctcaatt cttgcag gaa gcg cat gta ttg get 4437 

Glu Ala His Val Leu Ala 
210 

cga ate gcc gag etc cgt aag aac ggc tta tgg teg aac agt cgt etg 44 85 
Arg lie Ala Glu Leu Arg Lys Asn Gly Leu Trp Ser Asn Ser Arg Leu 
215 220 225 

eca aag tgc gtc gaa cct gaa cgt aat aaa acg cat tgg gat tat eta 4533 
Pro Lys Cys Val Glu Pro Glu Arg Asn Lys Thr His Trp Asp Tyr Leu 
230 235 240 

etg gaa gag gte aaa tgg atg gea gtt gat ttc cga ace gag aeg aat 4 581 
Leu Glu Glu Val Lys Trp Met Ala Val Asp Phe Arg Thr Glu Thr Asn 
245 '250 255 

acg aag cga aaa ate gcc aaa gtt ata get cac gcc att gcg aaa eag 4629 
Thr Lys Arg Lys lie Ala Lys Val lie Ala His Ala He Ala Lys Gin 
260 265 270 275 

cac cgc gac aag cag ate gag att gag aga gee gcc gaa egg gag ate 4677 
His Arg Asp Lys Gin He Glu He Glu Arg Ala Ala Glu Arg Glu He 
280 285 290 

aag gag aag cga aaa atg tgt gea gga ate gcg aag atg gta egg gat 4725 
Lys Glu Lys Arg Lys Met Cys Ala Gly He Ala Lys Met Val Arg Asp 
295 300 305 

ttc tgg teg tet acg gat aaa gtt gtg gat att cga gcg aag gaa gtt 4773 
Phe Trp Ser Ser Thr Asp Lys Val Val Asp He Arg Ala Lys Glu Val 
310 315 320 

etg gag teg agg etc agg aag gcg aga aat aag cat ttg atg ttt gta 4821 
Leu Glu Ser Arg Leu Arg Lys Ala Arg Asn Lys His Leu Met Phe Val 
325 , . ..^ 330 _ _ 335 

att gga caa gte gat gaa atg age aat att gtg caa gaa gga ett gtt 4 869 
He Gly Gin Val Asp Glu Met Ser Asn He Val Gin Glu Gly Leu Val 

-^4^0--^-- 345— ^--^ ^ _ -350 355 • 

tea teg teg aaa tec eca tea att gea teg gat cga gat gat aaa gat 4917 
Ser Ser Ser Lys Ser Pro Ser He Ala Ser Asp Arg Asp Asp Lys Asp 
360 365 370 
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qaa gaa ttc aaa gca cct ggc tct gat tea gaa tct gac gat gag cag 4965 
Glu Glu Phe Lys Ala Pro Gly Ser Asp Ser Glu Ser Asp Asp Glu Gin 
375 380 385 

aca att gca aac gcg gaa aag tea eag aaa aag gaa gat gtt cga cag 5013 
Thr lie Ala Asn Ala Glu Lys Ser Gin Lys Lys Glu Asp Val Arg Gin 
390 395 400 

gaa gtt gat get ctt caa aae gag gca act gtg gat atg gat gac ttt 5061 
Glu Val Asp Ala Leu Gin Asn Glu Ala Thr Val Asp Met Asp Asp Phe 
405 410 415 

ttg tac act tta eeg ccg gaa tat ctg aag get tat ggt ctg acg cag 5109 
Leu Tyr Thr Leu Pro Pro Glu Tyr Leu Lys Ala Tyr Gly Leu Thr Gin 

420 425 430 435 ^ CD o 

z z 

gag gat ttg gag gag atg aag cge gag aaa ttg gag gag cag aag get 5157 m < 

Glu Asp Leu Glu Glu Met Lys Arg Glu Lys Leu Glu Glu Gin Lys Ala g g ^ 

440 445 450 555 

egg aag gaa get tgt ggt gat aat gag gag aaa atg gag att gat gaa 5205 z 2 <^ 

Arg Lys Glu Ala Cys Gly Asp Asn Glu Glu Lys Met Glu lie Asp Glu H g .go 

455 460 465 | jj. fc? 

gttcgtagga tgctectaaa aaaattacet aaaaaaaatc gattttceet ggaaaaaate 5265 x \is <^ 2 

etetggaaat gaeccgaaac gteatggcgg ctcgaaattt tgaaaaaaaa aaccccccaa 5325 ON 

attteeagct aaaateteaa attttattgc atattttggt agttcttttg ttgtccgagg 5385 z O m ^ ^ 

tgcgtttttc agctgaaaat gtacctgaat ctgcaagtaa acgaeeaata tatgcaataa 5445 *^ ^ q3 ^ 

atgatgataa ttaatttccg atactgaaat gtgggcgaaa tttgagattt egactgaaaa 5505 < ^ Id ^ 

cgtettaaaa atcacccaaa acccggcttt accgcacgaa ggtttgaaga aaatggceaa 5565 § < ^ -2 

tttttagcca aaatctcaaa tttegtccac ttttcagtca gaaattagtt ttttgaaatt 5625 E I- x 

aattaaeace ttttattgea tattttcgte gtttattcgt tgatcgaggt gctttttcgg 5685 <xi;r^^ 1 
tcgatgggtg cacaaattcg gtaattgtgc atccatcggc tgaaaatgct ecagaatttg 5745 ° ' 

egaatgaacg gtgaaaattt aagattttag attgaaataa gcegtttttt agagaaaatt 5805 IT [3 8 ^ ^ I 

ggtcgttttg agacattaaa ttcaatttaa atcecetett tattttcag age cca tea 5863 o) ^ 3. c §> i 

ser Pro Ser ^0<lfi£ j 
470 I 

tea gat get caa aag cct tec acc tea age tea gat etc ace gee gag 5911 ^ 
Ser Asp Ala Gin Lys Pro Ser Thr Ser Ser Ser Asp Leu Thr Ala Glu 
475 480 485 



eag ctt caa gat cca aca get gaa gae ggc aac ggt gat ggt cat ggt 5959 
Gin Leu Gin Asp Pro Thr Ala Glu Asp Gly Asn Gly Asp Gly His Gly 
490 _495 500 

gta ctt gaa aac gtg gat tac gtg aag etc aac agt cag gat agt gat 6007 
Val Leu Glu Asn Val Asp Tyr Val Lys Leu Asn Ser Gin Asp Ser Asp 

" 505 Sro - 515 

gaa cga caa caa gag ttg gcg aat ate gca gaa gaa gcg ctg aaa ttc 6055 
Glu Arg Gin Gin Glu Leu Ala Asn He Ala Glu Glu Ala Leu Lys Phe 
520 525 530 

cag cca aaa gga tat aca ctt gag acg aca caa gtc aag acg ccc gta 6103 
Gin Pro Lys Gly Tyr Thr Leu Glu Thr Thr Gin Val Lys Thr Pro Val 
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535 



540 



545 



550 



r^n^ ttc ctQ att cga gga caa ctg aga gaa tat caa atg gtt gga ttg 
pro Phe Leu He Arg Gly Gin Leu Arg Glu Tyr Gin Met Val Gly Leu 



555 



560 



565 



aat tgq atg gtt aca ctt tat gag aag aat ttg aat gga att ctt gcc 
ASP Trp Met Val Thr Leu Tyr Glu Lys Asn Leu Asn Gly He Leu Ala 
570 575 580 

qac gag atg ggc ctg gga aag acg att caa acg att tec ctg ctg get 
ASP Glu Met Gly Leu Gly Lys Thr He Gin Thr He Ser Leu Leu Ala 
585 590 595 

cat atg get tgt agt gaa teg att tgg gga cea cac ttg att gtt gtg 
His Met Ala Cys Ser Glu Ser lie Trp Gly Pro His Leu He Val Val 
600 605 610 

ccg acg tct gtc att ctg aat tgg gag atg gag ttc aag aaa tgg tgt 
Pro Thr ser Val He Leu Asn Trp Glu Met Glu Phe Lys Lys Trp Cys 
gl5 620 625 630 

ceq get ctg aag att ttg acg tat ttt ggt acg gcg aag gag cgt gcc 
Pro Ala Leu Lys He Leu Thr Tyr Phe Gly Thr Ala Lys Glu Arg Ala 



635 



640 



645 



gag aag egg aag gga tgg atg aag ccg aat tgt ttc cat gtg tgc ate 
Glu Lys Arg Lys Gly Trp Met Lys Pro Asn Cys Phe His Val Cys He 
650 655 660 

aca tea tac aag acg gtt act caa gat att aga get ttt aag cag agg 
Thr ser Tyr Lys Thr Val Thr Gin Asp He Arg Ala Phe Lys Gin Arg 
665 670 675 



6151 



6199 



6247 



6295 



6343 



6391 



6439 



6487 



gtgcgtagaa 
ccaattttae 
attagtataa 
tttttgaaca 
gatatgaate 
aaaatggecc 
caaaatttgc 
ggtttatttt 
tatactaatt 
ataaatggtt 
gctttgtttt 
ctetgaaaat 
ggcccatttt 
acctaatttt 
attcgttgat 
agaccataaa 
taaaagtaat 
atttttttat 
ttttttgaga 
acaaagtetc 



attttgaaga 
cgataattgc 
tttttgcaaa 
atttttaaga 
gceegaaaat 
aaaattgtct 
caaattttcc 
agcgttattt 
atagcaattt 
tttttccaaa 
tttttttgga 
ttctttctca 
cagctaaaat 
tatttattca 
ttaaqtacat 
tgtactttaa 
gaataaataa 
tttagctgaa 
acgtctcgtt 
aaatttcttg 



tttgcggcga 
gaaatttttc 
aattggtact 
ggtttaataa 
gtcccccaat 
caaaattteg 
gtctcaegga 
cgttaattta 
ctgacccctg 
tttttaaagc 
eccaaattgg 
aaaaaaaagt 
caaaattttt 
tcacttttta 
ttatggtcag 
teaacgaata 
taattaggtt 
taagggcctt 
acgaaattcg 
ttagagattt 



atttggcgaa 

aattttatae 

tttttegaaa 

cgaaattcgt 

agacctaatt 

aaaaaaaaac 

gatcagaaaa 

gataeatttt 

acaaactttg 

gatattaaag 

tecaaaacta 

tacggcggtt 

teccaacttc 

ataaatattg 

tggggcacaa 

aacgcccaat 

ccagacgttg 

attgtctcaa 

gtagttttgg 

tttaaaaatt 



tttgcataat 

agtggtegga 

ttttgaacca 

tcatttgaac 

tcttaacaaa 

cgtaatttea 

agttttttgc 

ageccaattt 

aaattategg 

gtggagtacc 

ecgaattteg 

caaagttcgc 

tcggtgtctc 

tggtctttga 

aatgtaactt 

caaagaccac 

egacaccgag 

actttgaacc 

accaatttgg 

gatatttttt 



ttttttaaaa 6547 
aattgetata 6607 
ccataaaaca 6667 
acattttggc 6727 
aatttaaaaa 6787 
gctgaaatct 6847 
atttttttgt 6907 
ttgcaaaaat 6967 
taaacttggt 7027 
acaatttgag 7087 
taatgagacg 7147 
ggcaaaataa 7207 
aacgcetgga 7267 
ttgggctttt 7327 
tttttcccaa 7387 
aatatttatt 7447 
aagttggaaa 7507 
gccataaett 7567 
gtctaaaaaa 7627 
tttteag gcc 7687 
Ala 
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tgq cag tac eta att etc gat gaa get eaa aat ate aaa aac tgg aag 7735 
Trp Gin Tyr Leu lie Leu Asp Glu Ala Gin Asn lie Lys Asn Trp Lys 
680 685 690 695 

tec caa egt tgg eag get ctt ctg aat gtc egt get ega egt cgc ctt 7783 
Ser cm Arg Trp Gin Ala Leu Leu Asn Val Arg Ala Arg. Arg Arg Leu 
700 705 710 

etc ctg ace gga act eca ctt cag aac tct eta atg gaa ctg. tgg teg 7831 
Leu Leu Thr Gly Thr Pro Leu Gin Asn Ser Leu Met Glu Leu Trp Ser 
715 720 725 



ttg atg eat ttt ttg atg cca aca ata tte tea agt cat gat gat ttc 
Leu Met His Phe Leu Met Pro Thr lie Phe Ser Ser His Asp Asp Phe 
730 735 740 



gaa ttc aat get cca eta ate gga cga ctt cac aaa gtg etc egt ceg 
Glu Phe Asn Ala Pro Leu lie Gly Arg Leu His Lys Val Leu Arg Pro 
760 765 770 775 



840 



845 850 855 



att teg cag caa gat ccc tec tec tec tea get agt caa att ccg gaa 
He Ser Gin Gin Asp Pro Ser Ser Ser Ser Ala Ser Gin He Pro Glu 
875 880 885 



7879 



aag gat tgg ttc teg aat ccg ttg aca ggg atg atg gaa gga aat atg 7927 
Lys Asp Trp Phe Ser Asn Pro Leu Thr Gly Met Met Glu Gly Asn Met 
745 750 755 



7975 



8023 



ttt att ctg egg egg etc aag aag gaa gtt gag aag eag ctg eca gag 
Phe He Leu Arg Arg Leu Lys Lys Glu Val Glu Lys Gin Leu Pro Glu 
780 785 790 

aag act gag cat att gtg aat tgt teg ttg tea aag egg cag aga tac 
Lys Thr Glu His He Val Asn Cys Ser Leu Ser Lys Arg Gin Arg Tyr 
795 800 805 

ctg tac gat gac ttt atg agt egt aga tea aca aag gag aat eta aag 
Leu Tyr Asp Asp Phe Met Ser Arg Arg Ser Thr Lys Glu Asn Leu Lys 
810 815 820 

tct gga aat atg atg teg gtg etc aac att gtg atg caa etc cga aaa 
Ser Gly Asn Met Met Ser Val Leu Asn He Val Met Gin Leu Arg Lys 
825 830 835 

tgt tgt aat cat ccg aat etc ttc gag ccg egg cca gtt gtt get ccg 8215 
Cys Cys Asn His Pro Asn Leu Phe Glu Pro Arg Pro Val Val Ala Pro 



8071 



8119 



8167 



ttc gtc gtt gag aag ctt cag etc gat gtt ccg get egt etc ttt gaa 8263 
Phe Val Val Glu Lys Leu Gin Leu Asp Val Pro Ala Arg Leu Phe Glu 
860 865 870 



8311 



ait ttc -aat -tta -tcc- aaa atc"ggc-"tat~-caa tct" tec gtt-cga^ tct" gca^ 8359 
He Phe Asn Leu Ser Lys He Gly Tyr Gin Ser Ser Val Arg Ser Ala 
890 895 900 

aaa cca etc ate gaa gag ctt gaa gea atg age act tat ccg gag cca 8407 

I itie: KB PATHWAY AND CHROMATIN REMODELING ^ 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant(s):Horvitz et al. 
Filing Date: September 12, 2003 Serial No.: N/A 
Page 45 of 91 Customer No,: 21 559 

J 



FIGURE 19 



Lys Pro Leu lie Glu Glu Leu Glu Ala Met Ser Thr Tyr Pro Glu Pro 
905 910 915 

cga gca cca gaa gtt ggc gga ttt egg ttc aat egg aeg get ttt gtt 8455 
Arg Ala Pro Glu Val Gly Gly Phe Arg Phe Asn Arg Thr Ala Phe Val 
920 925 . 930 935 

gca aag aat ccg eat aeg gaa gag teg gag gac gaa ggt gtt atg aga 
Ala Lys Asn Pro His Thr Glu Glu Ser Glu Asp Glu Gly Val Met Arg 
940 945 950 



8503 



agt cgt gtt etg gtgaattttt aggaaaattg agaaaatgat ctaattgttg 8555 
Ser Arg Val Leu 
955 



aattttttaa agaatttatg ggccacaagc cgatttgccg gaaattttga tttttggcga 8615 
tttgccgaaa-attttgattt ttggegattt-gceagaaat-t -ttgatttttg gcaattatcc 8675 
gatttgccgg aaattttgat ttttggcgat ttgecagaaa ttttgatttt tggcaattat 8735 
ccgatttgcc ggaaattttg. aattttggca atttteegat ttgceggaaa ttttgatttt 8795 
tggcaatttg ccgaattgcc ggaaattttg atttttggca atttgccgaa ttgceggaaa 8855 
ttttgatttt tggggatttg ceggaaattt tgatttttgg eaatttgcct atttgtcgga 8915 
aattttgatt tttggeaatt tgeegatttg teggaaattt tgatttttgg caatttgccg 8975 
atttgecgga aattttgatt tttggeaatt tteegatttg ecaaaaattt tgatttttgg 9035 
cgatttgccg atttgecgga aaaaeatttt gtgageeaat tttetegaaa tttgggettc 9095 
aatattttca aattattcca aattttccac tgattcegaa tatctaagta aaaaaaaatt 9155 
ccctgatttt atatttcagc ttaaaatcgc taattttegc gtcagagacg acgtcatgtg 9215 
tcgatttact ggatttttaa tetttgtegg atgetaattt eegtttttca acgagtttcc 9275 
ttcattteca teggtttttg aegaagtttt ctttgaaaat atgttcttaa ggtcaattaa 9335. 
acgttttatt ateaaaaaaa aetagcaaaa ttggetttaa aaacaeattt teacagaaaa 9395 
ctccgacaaa aaeegacgaa aatgaaggaa acccccegtt tgaaaacaga aattagcatc 9455 
tgataaagat taaaatcccg taaatcgaca catggcgtct ggcgtctctg gcacgaaaag 9515 
tegegatttt aagetgacat aeaaaaaaag agggatatat ttttttacga atttttcaca 9575 
tagatattcg aaateagggg ggaaaatttg gagaaatttg agaaaatttc tcagatttcg 9635 
gattaaaaat atteaatttt tgttttetta tattaaaaaa aaattaactt ttataatttt 9695 
tcag cca aaa cca att aat gga aca get caa cca ctt caa aat gga aat 9744 
Pro Lys Pro He Asn Gly Thr Ala Gin Pro Leu Gin Asn Gly Asn 
960 965 970 

tea ata cea eaa aat get eea aat egt cea caa act tea tgc att egt 9792 
Ser He Pro Gin Asn Ala Pro Asn Arg Pro Gin Thr Ser Cys He Arg 
975 980 985 

tea aaa ace gte gta aat aca gtt cca etg acc ate tec ace gat cga 9840 
Ser Lys Thr Val Val Asn Thr Val Pro Leu Thr He Ser Thr Asp Arg 
990 995 1000 



agt ggt ttt eat ttt aat atg gee aat gtt gga aga ggt gtt gtt cgt 9888 
Ser Gly Phe His Phe Asn Met Ala Asn Val Gly Arg Gly Val Val Arg 
r005 • - -^^ 1010— — ' 1015 

ttg gat gat tea gca egt atg age cca ccg etc aaa cgt cag aag etc 9936 

.Leu- Asp~Asp..Ser. Ala..Arg_Met_Ser,.Pro^Pro. Leu„..Lys.„Ar:g.-.Gln„ Lys Le.u„ _i 

1020 * 1025 1030 



acc gga act gca aeg aat tgg agt gat tat gtt ccg cga cac gtt gtt 
Thr Gly Thr Ala Thr Asn Trp Ser Asp Tyr Val Pro Arg His Val Val 
1035 1040 1045 1050 
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qaa aag atg gaa gaa teg aga aaa aac cag ctg gaa att gtt cga agg 
Glu Lys Met Glu Glu Ser Arg Lys Asn Gin Leu Glu He Val Arg Arg 
1055 1060 1065 

cga ttt gag atg att cgt get ceg att att cea ctg gaa atg gtt gcg 
Arq Phe Glu Met He Arg Ala Pro He He Pro Leu Glu Met Val Ala 
1070 1075 1080 

ctg gtt cga gag gaa att att gca gaa ttt cea cgt ttg get gtg gaa 
Leu val Arg Glu Glu He He Ala Glu Phe Pro Arg Leu Ala Val Glu 
1085 1090 1095 



10032 



10080 



10128 



gag gac gag gtt gtg cag gag agg ctt ttg gag tat tgc gag ttg ttg 10176 
Glu Asp Glu Val Val Gin Glu Arg Leu Leu Glu Tyr Cys Glu Leu Leu 
1100 1105 1110 

gtg caa aggtagaatt ttgaaaatta ttactttget tttttttaaa ceaaaattgg 10232 

Val Gin 

1115 

cccaaaacta ccgaatttcg taatgagaca ttctgaaagc ttctcaaaaa aaaagttttg 10292 
gecgctcaaa gttcgggaaa ataaggecca ttttcagctg aaatcaaaat tttttccaac 10352 
ttctcggtgt cgcaacgtct ggaactaaaa ttttggaaaa cgagaaattt tccatttttt 10412 
gcaagctgaa aaatcaaagt ttttttttcc tcaaaattgg acaaacaaaa aaattttttt 10472 
ttgaaaattg atcgaaaaaa ttcaaaattt ctataatttt tcgatttttt aaataaaact 10532 
ttcatcattt ttcttccaaa tttagttttc tcgattttaa cttttttcaa aaaaaaattt 10592 
tttaatacga aaaaaattca attttagctc taattctttt ttagacccaa attggtccaa 10652 
aactaccgaa tttcgtaatg agacgttctg aacatttctc aaaaaaaagt tatgacggtt 10712 
caaagttcgg caaaataagg cccattttca tataaaatca aatttttttt ctaacttctc 10772 
ggtgtcacaa egtctggaac ttaattttta tttaattatt acttttcaat aaatattgtg 10832 
gtcttttatt aggcgtttat ttgttgattt aagtacattt atggtcaagt ggggcccaaa 10892 
taaaagttac attttgtgcc cacatgacca taaatgtact taaatcaacg aataaacgcc 10952 
taatcaaagg ccacaatatt tattaaaaag tgttgaataa ataaaaatta ggttccagac 11012 
attgtgacac cgagaagtta aaaaaaattt tgattttagc tgaaaatggg ccttattttg 11072 
ctgaacttta aaccgctata actttttttt gagaaatttt cagaacgtct cattacgaaa 11132 
ttcggtagtt ttggaccaat ttgggtctaa aaaagaatta gagctaaaat tgaattttct 11192 
tcgtattaaa aatttttttt ttgaaaaaag taaaaatcga gaaaactaaa tttggaagaa 11252 
aaatgatgaa aattttattt aaaaaatcga aaaattatag aaattttgat cgattttttc 11312 
gatcaatttt caataaaaaa ttttttgttt gtccaatttt gaggaaaaaa aaaactttga 11372 
tttttcagct tacaaaaaat ggaaagtttc tcgttttcca attttttgat gtggattttt 11432 
atgagaaaaa atatataatg tcacaaaaaa tagattatta tctaaaaatc gaaaaaatta 11492 
aattttccag ttttcaggaa aaaaatcgtt aagaaattgt ttttccatta aaggtggagt 11552 
accgaatttt gagacgctgc ttttttagac ccaaaatggt ccaaaactac cgaatttegt 11612 
aatgatacgc tetgaaaaat tttcaaaaaa aaagttgtga ccgctcaaag ttttggaaaa 11672 
atggcatatt tttagctaaa atctcaaatt ttggcaactt atcggtgtcg cagcggttgg 11732 
aacttaattt ttatttaatt gtcattcatt aatgcatgtt ttggcatttc attatgtgtt 11792 
atttcgttga ttgagatgct ttttgtgcct gcatcgacca aaaaaccatc tcaatcaacg 11852 
"^aataacaca^taataaaatg ccaaaatatg xattaaagga tgataatcaa ataaaaatta 11912 
agtttcaacc gctgcgacac cgctaagttg ccaaaatttg agattttagc taaaaatggt 11972 
ceatttttet aaaactttga gcggtcacaa cttttttttt gagaaatttt cagagcgtct 12032 
cattacaaaa attggtaggt tcggaccaat ttgggtctafi^jj^gcagcg tctc 1?092 
cggtacttca cctttaaagt tt¥ca'att'ta" aagtataaa aaaattgacg 12152 

aaaaaatttt ttaaaaattt tttcttccga aaaaaaaatt aattttaatt tttgtt aga 12211 

Arg 
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FIGURE 19 



ttc gga atg tac gtc gaa cca gtg ctg acc gat get tgg eag tgt cgt 
Phe Gly Met Tyr Val Glu Pro Val Leu Thr Asp Ala Trp Gin Cye Arg 
1120 1125 1130 



12259 



cca tea teg tct ggt ctt cca tea tat att cgc aac aat tta tea aat 1230.7 
Pro Ser Ser Ser Gly Leu Pro Ser Tyr He Arg Asn Asn Leu Ser Asn 
1135 1140 1145 



ate gag etg aat tct cgt tct ctt etc etc aac ace tec act aat ttc 
lie Glu Leu Asn Ser Arg Ser Leu Leu Leu Asn Thr Ser Thr Asn Phe 
1150 1155 1160 1165 



12355 



gat acc cga atg teg ate tea cgt get ctt caa ttc cca gaa etc cgt 12403 
Asp Thr Arg Met Ser He Ser Arg Ala Leu Gin Phe Pro Glu Leu Arg 
1170 1175 1180 



ctg_ajtc^gag^ tac gat tgt gga aag ctt cag aeg ttg get gtt ctg ctt 
Leu He Glu Tyr Asp Cys Gly Lys Leu Gin Thr Leu Ala^'Val'Leu Leu 
1185 1190 1195 



12451 



cgt cag ttg tac ctg tac aag cac aga tgt ctg ate ttc acg caa atg 
Arg Gin Leu Tyr Leu Tyr Lys His Arg Cys Leu He Phe Thr Gin Met 
1200 1205 1210 



12499 



tea 
Ser 


aag atg etc gae gtt ctg cag 
Lys Met Leu Asp Val Leu Gin 
1215 1220 


ace 
Thr 


ttc 
Phe 


ctt 
Leu 


tct cat 
Ser His 
1225 


cac 
His 


Gly 


tat 
Tyr 


12547 


cag tat 
Gin Tyr 
1230 


ttc cgc etc gac ggt 
Phe Arg Leu Asp Gly 
1235 


acc 
Thr 


act 
Thr 


ggt 
Gly 


gtc gaa 
Val Glu 
1240 


caa 
Gin 


aga 
Arg 


cag 
Gin 


gcg 
Ala 
1245 


12595 


atg 
Net 


atg 
Met 


gag egg ttc aac gcg 
Glu Arg Phe Asn Ala 
1250 


gat 
Asp 


cec 
Pro 


aag gtg 
Lys Val 
1255 


ttt 
Phe 


tgc 
Cys 


ttc 
Phe 


att etg 
He Leu 
1260 


12643 


teg 
Ser 


acg 
Thr 


aga tee ggt ggt gtt 
Arg Ser Gly Gly Val 


gga 

Gly 


gtc 
Val 


aat 
Asn 


eta 
Leu 


ace 
Thr 


ggt 

Gly 


get 
Ala 


gac 
Asp 


act 
Thr 


12691 



1265 



1270 



1275 



gtg ate ttc tac 
Val He Phe Tyr 
1280 



gat teg gat tgg aat ecg aeg atg gat get eag get 
Asp Ser Asp Trp Asn Pro Thr Met Asp Ala Gin Ala 
1285 1290 



12739 



cag gat aga tgt 
Gin Asp Arg Cys 
1295 



cat cgt ate gga cag acg agg aat gtc teg att tat 
His Arg He Gly Gin Thr Arg Asn Val Ser He Tyr 
1300 1305 



12787 



cga ttg 


att 


tec 


gag cga aca 


att 


gag 


gag 


aat 


att 


e.tg 


aga 


aag 


gca 


Arg Leu 


He 


Ser 


Glu Arg Thr 


lie 


Glu 


Glu 


Asn. 


He 


Leu 


Arg 


Lys 


Ala 


1310 






1315 








1320 








1325 


aca cag 


'aag' 


-egg- 


cga " c t't^ g ga 


gag: 


ttg" 


"gca^ 


att" 


gac" 


gag" 


"get 




ttc" 


Thr Gin Lys Arg 


Arg Leu Gly Glu 


Leu 


Ala 


He 


Asp 


Glu 


Ala 


Gly 


Phe 








1330 






1335 ^ 








1340 


aca ece 


gag 


ttc 


ttc aaa caa 


tct 


gac 


agt 


att 


egg 


gat 


ctt 


ttt 


oat 
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12835 



12883 



12931 



FIGURE 19 



Thr Pro Glu Phe Phe Lys Gin Ser Asp Ser He Arg Asp Leu Phe Asp 
1345 1350 1355 

gga gag aat gtg gaa gtg act get gtg gca gat gtt gcg acg acg atg 12979 
Gly Glu Asn Val Glu Val .Thr Ala Val Ala Asp Val Ala Thr Thr Met 
1360 1365 1370 

age gag aaa gaa atg gag gtt gcg atg gca aag tgt gaa gat gaa get 13027 
Ser Glu Lys Glu Met Glu Val Ala Met Ala Lys Cys Glu Asp Glu Ala 
1375 * 1380 1385' 

gat gtg aat gcg gcg aag att gcg gtg gcc gag gcg aac gtt gat aat 13075 
Asp Val Asn Ala Ala Lys He Ala Val Ala Glu Ala Asn Val Asp Asn 
1390 1395 1400 1405 

gcg gag ttt gat gag aaa tea ttg ccg ccg atg age aat ttg caa gga 13123 
Ala Glu Phe Asp Glu Lys Ser Leu Pro Pro Met Ser Asn Leu Gin Gly 
1410 1415 14120 



gat gag gag get gat gag aag tat atg gag ttg ata eaa c aggtaaaatt 13173 
Asp Glu Glu Ala Asp Glu Lys Tyr Met Glu Leu He Gin 
1425 1430 

cggcggaaat cggaaatttt cccatttaga atatcaaatt ttgeeegatt gtgtcgtttt 13233 
ttgatttttc gatttattcg atttgttttt gagggaaaat cggaaaaatg tteagaaaat 13293 
taaecataac atgtgatctt tttaaaatct tagcgcaaat gtcttetaaa aaataaagaa 13353 
tgaeeaaaaa ttttaageta atttttgaaa aaccaaagaa aaaatttaga tttttegatg 13413 
tttteegaga caaaaagaea aaaacggaaa ttgtegaaaa tgaatgaaat . ttttaatttt 13473 
tcagcaaaaa aaaaatagta cttaatttta aaaaatgtga tcatttcggt aggaaaatct 13533 
ggaaaaatcg attttcaaae aaaaaaaaae cgagccteta caatcttttt ttttcecgaa 13593 
atetecagaa ctteteacaa taaeaactat ataaatttca aaatttc ag etc aaa 13648 

Gin Leu Lys 
1435 

cea ate gaa cga tat gee att aac ttt ctt gag aea eag tac aag cea 13696 
Pro He Glu Arg Tyr Ala He Asn Phe Leu Glu Thr Gin Tyr Lys Pro 
1440 1445 1450 



gaa ttt gag gaa gaa tge aaa gag gca g aggtatatta tteeattcat 13744 
Glu Phe Glu Glu Glu Cys Lys Glu Ala 
1455 1460 

etgaettttt tttttttttt ttaaatttaa atttcaceaa attaattac ag get ctt 13801 

Glu Ala Leu 
1465 

ate gae caa aaa cgc gaa gaa tgg gae aaa aat etc aac gat ace gee 13849 
He Asp Gin Lys Arg Glu Glu Trp Asp Lys Asn Leu Asn Asp Thr Ala 
i'^'^O - 14.75^ -.jL^:. 1480 

gtc att gae etc gae gat teg gat agt etg ctg etc aac gat cet teg 13897 
Val Ile^Asp^eu Asp Asp Ser Asp Ser Leu Leu , Leu Asn Asp Pro Ser _ 
1485 * ' 1490 * 1495 " 



act tct gee gat ttt tat eag age tea agt ctt tta gae g aggtaegcga 13947 
Thr Ser Ala Asp Phe Tyr Gin Ser Ser Ser Leu Leu Asp 
1500 1505 1510 
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tratcatcgt cgcagcagca gccttctcca aaaagccgct caaaaaccgg caaaaaagcc 14 007 
tcaaaacttc caaattcgtg ctcgctcccc gtctaagcgt aaatctcagg ctccttcctt 14 067 
cqatccatat gtttcgtacg caccgcacgc gctcgcttct cccccggatt ccccgcgtaa 14127 
qaqaaqatca cgtggtgcgc gtagtttagg tagtggtggt ggtggtggtg gtggtagtag 14187 
atctgttgga agacctgccc gccgatcagt gaagaaagaa gaatcagatg atgatgatga 14247' 
qqattattgc caagaagagg aagtgaagcg . aaatccggca gaaaaggtcc cgccgaaaag 14307 
aaaacqagtt gtgtttgtgg aacctccaga ggtgaagccg ccggagccga aaaaacgagt 14367 
tqttqttcct gctccatcat catcatcatc agctctaact actcttccac aacaaggacc 14427 
qctqatttcg ttgccaaaag ctgtgccagt tgtacctcgg ccccaacaac aagcaccacc 14487 
acagctcatc aaaaagcacc agcagactct gatgcctgtg aaggtgctca agattagtgg 14547 
tgqtggtggt ggtactccag gaccatccag tgtatcgcca ggtccatcaa tcctccgaag 14607 
aaccqttgtt ccaggcatag gcgctggtgg tgttggacgc ctaccgcttg tcagaatgcc 14667 
tqttcqccct ccatttcctg gctcgcaagc tcctgctcca ccgctgagaa gtggtgttgc 14727 
tccaacagct cctgcagcag ctccacgcca gttcgtcgtt ccgtcgtcga gagttcgagt 14787 
tatcacgacg agaactccgg tcgccaccac catggtgcaa caacaacaaa gcccgagccc 14847 
qttgatgttt ccagtccggg ttgtgcaaag gcccgggcca tctggaccac caccacctgg 14907 
acctccagat cgcccaggat ttggaatcta tgagaagccg agattctcac ttggatcacg 14967 
aaqaagccgt ggagattcgg gcccggaaga tccggcgcca ccacagccac caccacccac 15027 
cacttctagg ccaccgccac aagcctaggc gctaggattt tccttttttt tttgttgatt 15087 
tttgctcttt ttttgctctc tcatgatttt ataatctcat tttgctttaa tatttccatt 15147 
tttttggatg tgtggaattt ttttttttga aaatcgggaa aaaacgaaaa atttgaactt 15207 
tttqgtgatt ttcagagaaa aatccgtttt taaatgaaaa aatcggaata attcagattt 15267 
ttcgaaaaaa aaaaccgaga aaatttcaaa ttttcagttt tttttttcaa aaaatcgaaa 15327 
aaaaaagtaa attttcagaa ttatcagcca agtttttgcg attttttgaa aaatttcaat 15387 
ttttggcaat ttttgggaaa aaatcaattt ttaattcaga aaattggaaa aattaagatt 15447 
tttcgaaaaa aaaaacgaag aaagtttcaa atttttagct tttttcaaaa aatcgaaaat 15507 
cggaattttt ttaatttttc gaataaaaaa aatcgaagaa attccaaaac tttgcgtttt 1556? 
ttcttgaaat tatctgaaaa ccggaatttt ttttcaaaat tcgccatttt ttgcgaattt 15627 
ttgtaatctt tttccgagaa aactcgattt tttaaatctt aataattcag atttttcgat 15687 
tttcttttgt tccaaaaagt caaaaaccga acaattattt atttcaaaaa ctctaaaaat 15747 
tttcaatttt ttggaaattt tcgggtataa aaaaaaccca tttttaaatc aaaaaatcgg 15807 
aaatttttgt gatttttcga tttttttcac tccaaaaaaa ttccacacag caaaaaataa 15867 
actccgcgca tttttgagcg cacctttcaa tgttttaatt cttatcacga cgtcaaaatt 15927 
cggttatttt tcacacacac acattttcct cccgagcggt tctttttttc atgagttctc 15987 
ccatgttttg tttttatatt tgagacattt ttttttgttg ataagtttca acttcttctt 16047 
cttcttctga ctataaacgt ttttctccat gttttttgcc tgttttctgc cgattttttg 16107 
acacccaaaa ttttttttca ttttcgctcg aaaatgcacg tcgttggctc tagctttgg^ 16167 
aagtttttaa cactgatttt ctggtttttt tttttttttg cagaattttt cagagatagg 16227 
gggctcattc cagcagggtt tcccactata tttcgcattt tttccaaaaa tttttgtatt 16287 
ttcaaaaatt tccaaaaaga aaggggtttt ctttaccaaa tttttctcgc cacttttggc 16347 
ttaattttgg ctttagagat tcgatcgaaa aaattgcgaa agtggcgaga aatctcactg 16407 
gtttgatgtt tgacccccta ctatagaaaa tttgaaaaaa aaaaaaaaaa aaaaaaacta 16467 
gacgaaattt gtggaaatct tgctggagtt tgacgagtcg atggtggatt tttcttgaaa 16527 
cgaatgaaac ggtgattttg gatcggagaa atatggcgaa aaatggtgag aaatgacgag 16587 
gaggaggaag aagctgaaaa tctggaggaa caaaaattgt gtggaagtct cgggaagaaa 16647 
ttagaattga aattttaaag tgttctgaga attttttgtg tgaaattttt ttaaatctgt 16707 
agatcaaata tcaaaaaaaa aaatcagaac tattacgtgt ttatccacaa agatgagaaa 16767 
aa~tcgccata" tctggqgcgc -aaatgaaccc gcgggaagag acaaaactac tgtagttttt 16827 
aaccaatttg tgtaga'ttta cgagctattg cgtcatcgaa ttgaatttaa ttttcaggcg 16887 
tttcacacgt ttttatattg aaatttatct atttattgaa tcaatcttaa aagaaaacac 16947 
_?n;^aaatttt. tt.t t.aaaaat .„tggggct.caa aattaaattc aattcgatga c gcaa tagct 17007 
cgtaaatcta"^ cacaaattgg tta'aaaacta 'cagtlTgtttt" gtctcttc^^^ gcgggttcat '17067 
ttgcgcgcca gatatggtga tttttctcat ctctggataa acacgtaata acatttctcg 17127 
gcacaataaa tttttgctga aacaagtgcg cgcctttgaa gagtactgca atttcaaaca 17187 
cggttttttg gttggaaagc acagtacttt ttcaaaggtg cacaccttct cgaatttctc 17247 
ttcgtgtcga gaccaagaat gccatttttc gatttttaaa aaatcaaaaa aaaaattacc 17307 
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tttttaaagg tggagtaccg aaatttgaga 
aactaccgaa tttcgtaatg agacgttctg 
tttaaagttc agcaaaataa ggcccatttt 
tcggtgtcac aacgcctgga acctaatttt 
tggtctttta ttaggcgttt attttattga 
acattttgta ccctaaaaac cataaatgta 
ggctacaata tttagtagaa agtgataaat 
accgagaagt tggcgaaaac tttgatttta 
gagcggtcat aacttttttt tgagaaagaa 
tagctttggg ccattttggg ' ccgaaaaagc 
gcctttacct tggtgaaatt ttttaatctg 
aacacaaatt cggtgttagt ttaaaaaaac 
aatttttgtc ggttttcttc gcgccaaaaa 
tttgttaaaa ggtgctgctc ctttgaagag 
tggaattttc ttcaattttt catagatgtt 
gtcgtgtcga gacccgaaaa aattttgcgt 
catattgatc caaattgccg atttgccgga 
ccggaacatt tgattttctg gaatataccg 
tgccggaaat ttagaattcc ggcaatatgc 
tatgccgatt tgccggaaat tttgattttc 
tttccggcaa tatgccgatt tgccggaatt 
gaaattttga ttttcggcaa tataccgatt 
cgatttgccg gaatttttgg ttttcggaaa 
gccgatttgc cggaaatttt . gatttccggc 
tcggaaattt gccggaaatt tagaattccg 
ttccggcaat atgccgattt gtcagaagaa 
gatttttct ag.ata aaa ttc tac gac 
Glu lie Lys Phe Tyr Asp 
1515 



ctttgttttt ttcggcccaa aatggtccaa 17367 
aaaatttctc aaaaaacaac gttatggcgg 17427 
cagctaaaat caaaattttt tcccagcttc 17487 
tatttattca tcactttttg ataaatattg 17547 
tttaagctta tttatggtct ttgtggcgtt 17607 
cttaaatcaa cgaataaacg cctaatcaaa 17667 
aaataaaaat taggttccag acgttgcgac 17727 
gctaaaaata agccattttc ccaaaacttt 17787 
attttcagaa tgtctcatta cgaaattcgg 17847 
aaagtctcaa atttcagcac tccaacttta 17907 
tagtatactt tatttttggc cgactttttg 17967 
aatcaaaact aacatattat ccagacgcga 18027 
gtacggtaac aggtttcggc acgatacatt 18087 
tgtctaataa ttttcaactt tcgtttctgt 18147 
ttcgatgaaa caaaaaatta acacaaaatc 18207 
ctgtgcaaca aacccggaaa attaaagtag 18267 
aattttgatt ttcggcaata taccgatttg 18327 
atttgccgga atttttggtt ttcggaaatt 18387 
cgatttgccg gaaattttga ttttcggcaa 18447 
ggcaatatac cgatttgccg gaacatttga 18507 
tttgatttcc ggcaatatgc cgatttgccg 18567 
tgccggaaca tttggtttcc ggcaatatgc 18627 
tttgccggaa atttagaatt ccggcaattt 18687 
aatatgccga tttgccggaa attttggttt 18747 
gcaatatgcc gatttgccgg aaattttgat 18807 
atcgtttgtc acccacacgt gtattgattt 18867 
gag ctg gac gat ate atg cca ate 18917 
Glu Leu Asp Asp lie Net Pro lie 
1520 



tgg ctt cca cca tea cca cca gat teg gat gcg gat ttc gac ttg aga 
Trp.Leu Pro Pro Ser Pro Pro Asp Ser Asp Ala Asp Phe Asp Leu Arg 
1525 1530 1535 1540 

atg gaa gat gat tgt etc gat ctg atg tat gaa att gaa eaa atg aac 
Met Glu Asp Asp Cys Leu Asp Leu Met Tyr Glu lie Glu Gin Met Asn 
1545 1550 1555 

gag get cgc eta cca eaa gtt tgt .eat gaa atg aga cgt ecg ttg get 
Glu Ala Arg Leu Pro Gin Val Cys His Glu Met Arg Arg Pro Leu Ala 
1560 1565 1570 

gaa aaa cag cag aaa cag aac aeg ttg aat gcg ttt aa tggtaatatt 
Glu Lys Gin Gin Lys Gin Asn .Thr Leu Asn Ala Phe Lys 
1575 1580 1585 

ttcaaaaaaa aatttttttg aaaaaattca attaaatteg attttgagea atttttateg 19169 

tgaagattge ataattttga gattttgege caagattttt gttaaattga aaaaaagaga 19229 

tgtgcgcett tatggagtac tgtagttttg aaaattgaaa ttacagtaet ctgtttaaag 19289 

gcgcacacat gtattacgta gcgaaaagaa aagtacagta attagttaaa taagactact 19349 

gtagcgcttg tgtcgattta cgggctctga attttatatg aatttttgaa aactagaaac 19409 

atetcaaatt gcataaaatt accatttgaa cctccegcea agtgattttg ttegacgggg 19469 

tgegcttgca cgttttctat" tttaatttaa ttcaattttt ' ttt^cttaat tetc i95i29 

ttttcatgtt ttcagtttga ttttgatgga aatttggaga caatatcaae ataaatgctt 19589 

ttcaatcgaa aatgtgcatt tatattgaea tttteteega attteeatea aaattaaact 19649 

gaaaacacga aaaatcggtg agaattaagc gaaaaaattg agttaaatga aaatagaaaa 19709 

cgtgcaagcg cgctccateg aacaaaatea attggcggga ggtteaaatg ggaattgtat 19769 
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19013 



19061 
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gcaattttca 
ctcgtaaatc 
agaaaatcat 
tatttgacgc 
acatttaaat 
attttttaaa 
tcatttttgc 
aaataatacg 
taattagaaa 
actaattact 
tttattttca 
ttctctcaat 



aaaggtcgta 
gacacaggcg 
aaataaatat 
gcaaaatatg 
gactactgta 
aaatgatgtc 
ttgattttat 
aagaaacgga 
atcgagatcc 
gtacttttct 
tatttttata 
taattcc a 



taaaattttg aagaaagcaa attaaattta aaaaatcgag 
ctaattttca aaaaaataaa atgacaccca aaaaatcata 
tacgggaaca caaaactcag agaacccgta ttgcacaaca 
aaatatctcg tagcgaaaag aaaactaccg taatttaaaa 
gcgcttgtgt cgatttacga gatctcgatt ttctaaataa 
agcgatattc catttgactt tgtttcttcg tattattttc 
ttaattttat. aattttattt aaaatcaagc aaaaacgaga 
gttaaatgga atatcgctga cataatttaa aaaaaaaatt 
cgtaaatcga cacaagtagt catagtacag tagtcattta 
tttcgctgcg agatatttca tatttttatt catattttta 
tatatatata tatatatatt tcttggcgtt ctaatgcagt 
gac att eta teg gca aaa gaa aag gaa teg gtg 
Asp lie Leu Ser Ala Lys Glu Lys Glu Ser Val 
1590 1595 



tac gat gcg gtc aae aag tge ctt caa atg cea caa tec gaa geg ate 
Tvr^Asp Ala Val Asn Lys_ Cys Leu Gin Met Pro Gin Ser Glu Ala He 



„ .Tyr-Asp 



1600 



1605 



1610 



19829 

19889 

19949 

20009 

20069 

20129. 

20189 

20249 

20309 

20369 

20429 

20480 



20528 



20576 



aca gca gaa tet gea geg tct eca gea tae aeg gaa cac tea tea ttc 
Thr Ala Glu Ser Ala Ala Ser Pro Ala Tyr Thr Glu His Ser Ser Phe 
1615 1620 1625 

teg atg gat gat aca age eag gat geg aag att gag eca agt ttg act 
Ser Met Asp Asp Thr Ser Gin Asp Ala Lys He Glu Pro Ser Leu Thr 
1630 1635 1640 

qaa aat caa caa ecc ace ace ace gee aet act act act aca gta cce 
Glu Asn Gin Gin Pro Thr Thr Thr Ala Thr Thr Thr Thr Thr Val Pro 
1645 1650 1655 1660 

eaa caa caa caa caa cag cag cag caa aaa teg teg aaa aag aag aga 20720 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Lys Ser Ser Lys Lys Lys Arg 
1665 1670 1675 



20624 



20672 



aat gat aat cga a cggtaeggag gttactageg aacaatttea agaaattttg 
Asn Asp Asn Arg 
1680 



20773 



aatttgtgaa 
aattgtcaat 
attttcggca 
ggaaattttg 
cgatttgccg 
caattttccg 
aatcccagca 
gaaettttca 
egatttgece 
gacaatttge 
■ tttgatttte 
tgccggaaat 
taaaaacact 
caaaaaattg" 
etaaattcct 



aatteaattc 
ttgccggaaa 
aattttegat 
attttegaea 
gaaatttcaa 
atttgccgga 
attttecgat 
ttttcggcaa 
gaacttttaa 
caatttgecg 
gacaatttge 
tttaattccg 
aaaaaceaaa 
egattceccg" 
atttaaaaga 



eggcaatttt 
ttttgatttc, 
ttgecggaac 
atttgetgat 
teecaacaat 
aatttcaatc 
ttgccggaaa 
agtgtcgatt 
tttttgacaa 
gaacttttaa 
caatttgecg 
gcaattttgc 
aattttcggt 
"aaaaatcgaa 
attgaaaaaa 



tcgatttgcc ggaaetttta attttcgeeg 20833 
cgccgaattg tcgatttgcc ggaaettttc 20893 
ttttaatttt tgaeaaattg tcgatgtgee 20953 
ttgccggaaa tttcaatccc aacaattttc 21013 
tttecgattt geeggaaatt teaatcccaa 21073 
ecaacaattt tccgatttge eggaaatttc 21133 
tttcaattec ggcaattttt cgatttgccg 21193 
tgccggaact tttcattttc gecgaattgt 21253 
attgtcgttt tgctggaaat tttgatttte 21313 
tttttgacaa attgtcgatt tgccggaaat 21373 
gaaettttca tttttgccaa attgtcgatt 21433 
gatttgccgg aaatttcaat tccggcaatt 21493 
ttteccgttt ttcgatgttt cagcttttet 21553 
aeaattttcg - gggttaaaac cgggaaatte 21613 
aactctcaaa attcc ag get caa aat 21669 
Lys Ala Gin Asn 



Title: RB PATHWAY AND CHROMATIN REMODELING 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant{s):Horvitz et al. 
Filing Date: September 12, 2003 Serial No.: N/A 
Page 52 of 91 Customer No.: 21 559 



FIGURE 19 



cga aca get gaa aat ggt gtg aaa cga gcg aca act cca cca cca tea 
Arg Thr Ala Glu Asn Gly Val Lys Arg Ala Thr Thr Pro Pro Pro Ser 
1685 1690 1695 1700 

tgg cgt gaa gag cca gat tat gat gga gcc gaa tgg aat ata gtt gaa 
Trp Arg Glu Glu Pro Asp Tyr Asp Gly Ala Glu Trp Asn lie Val Glu 
1705 1710 1715 



21717 



21765' 



gat tat gca eta ctt caa gca gtt eaa gtc gaa ttt gca aat get cat 

Asp Tyr Ala Leu Leu Gin Ala Val Gin Val Glu Phe Ala Asn Ala His 

1720 1725 1730 

tta gtc gaa aaa teg gcg aat gag gga atg gtg ttg aac tgg gaa ttc 

Leu Val Glu Lys Ser Ala Asn Glu Gly Met Val Leu Asn Trp Glu Phe 
1735 1740 1745 

gtg teg aat gcc gtt aat aag cag aca aga ttt ttc cge teg gcc cgt 
Val Ser Asn Ala Val Asn Lys Gin Thr Arg Phe Phe Arg Ser Ala Arg 
1750 1755 1760 

caa tgc tea att cga tat caa atg ttt gtt egg cca aaa gag etc gga 

Gin Cys Ser lie Arg Tyr Gin Met Phe Val Arg Pro Lys Glu Leu Gly 
1765 1770 1775 1780 

cag ttg gtg get tct gat ccg att tec aag aaa aeg atg aaa gtc gac 

Gin Leu Val Ala Ser Asp Pro He Ser Lys Lys Thr Met Lys Val Asp 
1785 1790 1795 



21813 



21861 



21909 



21957 



22005 



eta teg cat act gaa tta tct eat ttg aga aaa gga cga atg act acg 
Leu Ser His Thr Glu Leu Ser His Leu Arg Lys Gly Arg Met Thr Thr 
1800 1805 1810 



22053 



gag age caa tat get cat gat tat gga ata ttg act gat aag aaa eat 

Glu Ser Gin Tyr Ala His Asp Tyr Gly He Leu Thr Asp Lys Lys His 
1815 1820 1825 

gtg aat aga ttt aaa agt gtt cga gtg gcg gca aca egg aga cct gtt 

Val Asn Arg Phe Lys Ser Val Arg Val Ala Ala Thr Arg Arg Pro Val 
1830 1835 1840 

cag ttt tgg aga gge cct aaa ggt aga gga gga tgg ctt eat aat agt 

Gin Phe Trp Arg Gly Pro Lys Gly Arg Gly Gly Trp Leu His Asn Ser 

1845 1850 1855 I860 

cac tgc aac ttt ttc etc acg agg gac gag aaa aag tgg ttt eta ggc 

His Cys Asn Phe Phe Leu Thr Arg Asp Glu Lys Lys Trp Phe Leu Gly 
1865 1870 1875 

cat ggc cga ggt gcc gac aag ttt ca geggccattt atcttgcttt 

His Gly Arg Gly Ala Asp Lys Phe 
1880 



22101 



22149 



22197 



22245 



22291 



gtt"ttVcg*^"^cgt'tT^^ ^ga^tttte€t" cgttttttct 

gataaataaa tattttttge agatgetaaa aaaattteea agtaaaaaaa 
agtgggcatg cagcggtgaa agtgggcatt gtaatatgat ggattacggg 
ctaaactttt tetgaaacat gatacatgtg ctgcttaaat gctgagacta 
ataacgagac egetgaaaaa gttttgaggt ttceaaaatt caactttttt 



taata'aMcY 22351" 
tcatgtattc 22411 
tatacaaaac 22471 
ectgatttte 22531 
QQtqaaaaag 22591 



Title- RB PATHWAY AND CHROMATIN REMODELING ^ 
GENES THAT ANTAGONIZE LET-60 RAS SIGNALING 
Applicant(s):Horvitz et ai 
Filing Date: September 12, 2003Serial No.: N/A 
Page 53 of 91 Customer No.: 21 559 



FIGURE 19 



tcqagatttt cgcacaaaaa gttgaatttt gaaaacctca aaactttttc agcggtctcg 22651 
ttatgaaaat caggtaattt cagcatctaa gcatcatatg tatcatgttt cagaaaagtt 22711 
taqgttttgt attcccgtaa tccatctatt tacattgacc actttcaccg ctgcttgccc 22771 
actgaataca taattttttc acttggaaat tgttttagca tctggaaaaa gtatttattt 22831 
atcagtttta ataagaaaaa acgggaaaaa gctgtgaaaa acaaaagaaa acaggcggaa 22891 
aacaaagcaa gataaatggc cgctgaaact tgtcggcccc tcggccatgg cctagaaacc 22951 . 
acttttcttc gtccctcgtg aggaaaaagt tgcagtgata gtctaaaatt cggaggaatt 23011 
ttttaaaatt ggaaaaaatt gttaaatttt ttttttctgg aaattggaaa atcacaaatt 23071 
ttcgattttt gtttgttaaa aaaaaaaaga aaattggcat aataaaacat ttcttttttt 23131 
tttgaaaatt gggaacttct taatatcaga ttttttaagt aagat'ttttt tgattttccg 23191 
qaaattcgga aaacctgaaa attttcaaca tttcaaaata aaaatttccg tttttttttt 23251 
ctgaaaatct ccaacaaaaa aaggtcaaat cgtcagaatt attgttggaa gtggcggttt 23311 
ttcacgatta gagttcagta ttttttcttc tgaatttcaa atttgaaaaa aaatcgaata 23371 
aactgtagaa aaatgataga aaattaacaa aaattctgat taaaggtaaa gggaaaatag 23431 
accgtaatga ccgaatataa ctgttgaaaa tatcaacaaa aaaaattctg aattttttgt 23491 
gactttttca atttttcaag aataaaaaaa acgaccgaat aaaatatttg aattcccgcg 23551 
caaatgagtg actggttctg gccaatttac agtcttttta taaaagaaaa aatctagaaa 23611 
aaccggcgaa tttagccaga aaacgcaaaa aattaaaaat gacgtcactc atttgcgcgc 23671 
ggaatacaaa tttaattagg ccgtttcttt gatttttgaa aaattgaaaa aaccattaaa 23731 
aaatttagaa atttttttga attttttaca gttttttatt cggtcattat ggggttattc 23791 
aagtagtgtc ggaaaattaa aaagtgtaga aaaattacgt cacaactctg tattcaagta 23851 
tataaaaaca tgtatttaaa tacattttgc tacattactt gaataacccc attagggttt 23911 
attttcttta gagcaaaaaa aaacatgttt ggctctactc cacctttaaa tgaaaaaatc 23971 
gacaatttgt gattttgcaa tttccagaaa aaaaagaaaa aagttgcttt ttggaaaaaa 24031 
ccaaaaaaag ccatttgaaa aattttattt tccaaaaaaa attattttgc agctctagaa 24091 
tctcgaaatc tgcaatctct aaacggcgga atgccaccac gacacgagtc gagactcgcc 24151 
gaattcgacg taaaaaccaa tattcgcctg gacgccgagg acattgtcac aatgtccgac 242.11 
gagtcgattg tcgcctatga agcgagcaag aagaagctac tggccagtcg tcaaacaaaa 24271 
ccctcaccac gtcaagatgt ccgattccat acgctggttc ttcggccgta taccgtacct 24331 
gtgacaactg agtactcggc tgcaccttct cgtcgtgaaa tgcgcatcgc tgttccaccg 24391 
cttcagcctt cggctttatc tacgatttcc tcagttgctg ctgctgccac gtctgggcca 24451 
ctaccatcaa ttcagcattt gcagtcgtcg tctacgggct tgggatctca gcaaaatttg 24511 
caaaattcgc ataattctga gcaaagaaat aatgtgcaaa atatgcatca aaatcaatat 24571 
aattcaagtc aaaatccgcc aatacctatc cgacaaatcg gagcagcatc atcacaccaa 24631 
catgatcaag gatctcaggg gcctggggga aaaccacaag cctatcacct ggtgcaacag 24691 
ggatcacagc aacagcagca gcagcagcag caggcgacgt tacagcgaag aaatgcggcg 24751 
gcggcggcag ggtcgaatgt gcagtttatt cagcagcagc agcagcagca gcaatcgggt 24811 
aaaaattgta tggatttata ggaaattata tgaatttgcg cggggatagc cccggcgaaa 24871 
aacgggaaaa agcgacaatt taaaaaaaaa tcgtgtgaaa atctcaattt tttacaattt 24931 
tgaaagtaat tttttattga aaaaagtgga atttaggcat tcatccagag cagggctggg 24991 
accaaaaaaa atttttggac caaaaaccaa aaaacaaaaa attgaaattt ccgaaaaatc 25051 
aacttaagca tcaaaaattt tttgtttttt tttttgtttt ttggtttttt ttggtatttt 25111 
gacgaaaaaa cgattttttg gttttttggt ttttcgagac caaaaaaacc aaaaaatcca 25171 
aaaaaatgtt tgccgtgtct agtctcgacc tagacacggc aaacattttt tttttttgga 25231 
ttttttggtt tttttggtcc cgaaaaacca aaaaaaccaa aaaatcgatt tttcgtcaaa 25291 
ataccaaaaa aaaacaaaga attcccagcc cctttcgcca aaattgccgg atattttcaa 25351 
acctcaaaaa aaatttataa aggtggacta catcctgtgg ggaaattgct ttaaaacatg 25411 
cctatgggct cacaatgacc gaatatcatg attaaaaaat tcaacaaaaa aattactaga 25471 
ttttatgtga ttttttgaaa attaaaaaaa tctcagtttt caacctaatt cctatttgaa 25531 
tttccgccaa tttgatttgt tcgatggagc gcgcttgctt tatttttttt tattcattga 25591 
ttttattttt attagcatta tttcactgat tttcttcatt ttttgtgtgt ttttggtggg 25651 
_££ttgaaatg:_.aaaaaaaaca,.agataaa.tg<:^^ 
cttaaaacgg caacaagctt gaaatttgta tattttacac agttttacgc attttcaatg 25771 
actttttaac aaactttccg catttatctt gtttttttca gttcaatttc cattaaaaaa 25831 
cacacaaaaa aaaatgaaga aaatcagtga aataaggtta ataaataaaa taaatgaata 25891 
aaaatgatgc aagcgcgctc caacgaacga attcaattgg cggaaattca aatatggaat 25951 
taggtgaaaa ctgagatttt tttttcaatt ttcaaaaaat catataaaat ctagaaccat 26011 
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tttttgaatt ttttaatcat gatattcggt cattgtgacc ccataggcgt gttttaaagc 26071 
aatttcccca cagggtgtag tccacctttg acgaggtttg aaaatgtccg gcaattttgc 26131 
cgaaattgcc ggaaacttga gatttttcag tgaaaaattc caaatttcat gtggaaaact 26191 
gtttttttgt tttttggaaa atgcaacaaa aaaaactatt tggcgcgaaa acgcggatag 26251 
ttttgccaat tttcaaggat tttccgctat ttttaatgtt tttatgccga attttacttt 26311 
aaaaaatcat aattattcgg aaaatgctcg aagagcattt ccaattgtct gtggagcgcg 26371* 
tttgactaat cagataatat tccaggcggt caaggacaaa gcttcgttgt catgggctcg 26431 
cagagctcat caaatgatgg acaaggtgga gcatcgaccg tcggaggagg aggaggagga 26491 
tcacaacagc ctcaccagca gcagcagcag cagccacaac aaagaataca gtacattcca 26551 
caagttaccg gtagcggaaa taacggtgga ggtggtggaa gaggaggcta cggtagtaca 26€11 
ctggtcatgc caagaggagg acgtgttgtc agggttggta gaaatacaaa atcgcgaaaa 26671 
aacggcattt ccggcttccc gaccaatcag cgatttgctc cgcccacttt cggaccaatc 26731 
cgctgaccga ggcatttgat tggtttgaaa ttgggcggag cagcgaattg ctgatgcgaa 26791 
atacgggaag ttctcatttt gatggaaatt ctgcaaaatt ctttaaaaaa aacaaaatct 26851 
tctcaaattc ggaaaaaatc acaaaggaaa tcgaagaaaa tcgcgatttt tgattccccg 26911 
accaatcagc gatttgctcc gcccactttt gaaccaatca gcgttcgagg catttgattg 26971 
-gttcaaaact gggcggagca gcgagttgct gattggattt ttcagttttt aaatttttaa 27031 
agcttttttt aacggaaaaa ttcgagaaaa ccatagattt tgatgagaaa tgatgaaaat 27091 
tttcatgaaa aaatggaaaa atgattggaa attaatcaaa aaatcttgaa aaaaaatttt 27151 
ttttcagaga aaatgcttca tttttggctc tgaaacgcct cttttttatt tgtgcctccc 27211 
cgaccaatca gcaatttgct ccgcccactt ttgaaccaat cagcgaccga gcgatccgat 27271 
tggtttgaaa ttgggcggag ctaaaatgat tttaaaaaaa ttcccgattt gtttaatcta 27331 
gaaatttaga aaaaagaaat atagaaaaaa aatagaaaaa aattaaaaaa aaaaaaacaa 27391 
aaaatcggaa aacgtcggaa aatattacga aaaaaatttt tttaattgat tttttttcga 27451 
aaaaaactaa aattttaacc aaaaattcaa agaaaaaatt tgtttttgat ttttttttcg 27511 
aaaaaaaaaa aaattttaac caaaaattca aaaaaaaaat gtttttcttg atttttttcc 27571 
aaaaaaacta aaattttgac caaaaattca gcaaaaaaaa aattttttaa ttgatttttt 27631 
tttcgaaaaa aaataaaatt ttaaccaaaa attcaaaaaa aaaatttttt attgactttt 27691 
ttcgaaaaaa actcaaattt taaccaaaaa ttcaaaaaaa aaaatttttt ttttgatttt 27751 
ttccgaaaaa aactaaaatt ttaaccaaaa attcaaaaaa aaaatgtttt tcttgatttt 27811 
tttccaaaaa aactaaaatt ttgaccaaaa attcagcaaa aaaaaaattt tttaattgat 27871 
ttttttttcg aaaaaaaata aaattttaac caaaaattca aaaaaaaaat tttttattga 27931 
cttttttcga aaaaaactca aattttaacc aaaaattcaa aaaaaaaaat tttttttttg 27991 
attttttccg aaaaaaacta aaattttaac caaaaattca aaaaaaaatt ttttattgat 28051 
ttttttccaa aaaaactaaa attttgacca aaaattcagc aaaaaaaaaa ttttttaatt 28111 
gatttttttt cgaaaaaaac taaaattttg accaaaaatt caacaaaaaa aaaatttttt 28171 
attgattttt ttcgaaaaaa actaaaattt tgaccaaaaa ttcaacaaaa aaaaattttt 28231 
ccagccagcg ggaactctac caggcggtgg acgtctgtat gtcgatcata accgtcatcc 28291 
atatccaatg tcgtcaaatg ttgtgccagt acgtgttcta ccagccacgc aacaaggaca 28351* 
acaacgaatg atgacaggac aacgtcgtcc ggctccagcg cccggtactg tcgccgcaat 28411 
ggtgttgccg aatcgaggag ctggtggaat tccgcaaatg cgcagtttgc agtgagtttt 28471 
gcacggaaat tggacgattt tcagcgaaat tttcgggaaa aatggctatt ttgtgtttga 28531 
aattgcgaaa tttcacgatt tcgtcttaaa tacggtgcca acctacccca tgacggtttg 28591 
atctacaaaa aacgcgggaa tttttcacac aaaaatatgt gagacgtctg cacgttctta 28651 
accaatcggt tgaaaactct gccgcatttt tgtagatcta cggtagatca ctgcagattt 28711 
taagagagaa aaataaataa ataatcccac aaggttttta aaattttttt ttcaatcgta 28771 
aaaaatagcg aaaaattgtt tttcgcgtcg agaccctacg cacatttttt tgcaattttc 28831 
gcttcaaaat tacggtaccg ggtctcgaca cgacattttt attgtgtaaa atacacaatt 28891 
-tttt-ggaatt-ttcatcgatt cgaatttaaa tatttttaaa tgatttaatt aattcttaac 28951 
gaaaaaaaaa aagttcgaaa ctgcagtact ctttaaaggc gcacacatgt atgtatttat 29011 
aaaaaatgtc gtgtcaagac cgtacttttg gctcacaaat tgcaaaatat tgcggaattt 29071 
tttttaattt tagataaaaa aaaacatgaa aaatctatgg aaactaa act tataa^tt taa 29131 
aa aaaaatt ETftlta 

gaggccccaa tgactgaata tcatgattaa aacaatcaaa aaaaattttc tagattttat 29251 
atgatttttt gaaaattgga aaaatcacag ttttcaccta attctttttg aatttccgcc 29311 
aattggatta gttcggtgga gcgcgcttac attattttta attatttatt ttatttattc 29371 
tcgttatttg actgattttc ttcatttttt gtgtgttttc ctcggaaaaa ggaagaaata 29431- 
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aacaagacaa atgcaaaatg tttgttaaaa agtaattgaa aatgcgtaaa actttgatat 29491 
tctgagttcc gacgacaaca agcctgaaat tagtatattt cacagttttt ctcattttca 29551 
attacttttt aacaaacatt ttgcatttgt cttgtgtatt tcttccattt tccgaggaaa 29611 
aaacatagaa aatgaagaaa atcggtcaaa taacgagaat aaataaaatt aattttaaaa 29671 
aagatgcaag tgcgctccac cgaacaaatc caattggcgg aaattcaaat atggaattag 29731 
gggaaaactg tgatttttcc cattttcaaa aaatcatata aaatttggaa aatttttttg 2979,1- 
aattttttta atcatgatat tcggtcattg. gcgccccata ggcgtgtttt aaagcaattt 29851 
ccccactgag cgtjagtccac atttaatttt ccaaaacagc acatgctaat cctccaagtt 29911 
attccagacg aggcagttac accggcggtg gtggtcagca acgaatcaac gtgatggttc 29971 
aaccacaaca aatgcgcagc aacaatggcg gtggagtcgg tggccaagga ggcctccagg 30031 
gtggtccagg aggtccgcaa ggaattcgtc ggccactcgt cggacggcca ctacaacgag 30091 
gagtcgataa tcaggcgccg acggttgctc aggtcgttgt tgctccgccg caaggaatgc 30151 
agcaggcatc acaaggacca cccgtacttc atatgcagag agcggtttcc atgcaaatgc 30211 
cgacgagtca tcatcatcaa ggccaacagc aggctcctcc gcagagctca cagcaggctt 30271 
cgcaacaggc tcccacatcg gattctggga cgagtgctcc gccacgacaa gcaccaccac 30331 
cacaaaacta gaattttccc ctattatcct attttacccc ccaaaactct attaattaaa 30391 
taatttcctt cctatttttt tcttcgtgtg_aagattattt..gtcccccaac caagggtgtc 30451 
ggtttttcga tttttcgacg tttttcaaaa aaatttcgat ttttcgaaaa attagcttca 30511 
tattttggct attactctgc tttttagaag aaatttgtat gttttttctt gaaaatataa 30571 
gcaaaattag atttaaaaaa aatcatattt tatggttaat tttctgaaca tatttttcaa 30631 
ttttcgattt tcacagaaaa acatcgaaga atcgacaaaa tcgaaaaata tgttccgaaa 30691 
attaaccata aaatatgatt ttttttaaaa tctaattgtg attatattta taagaaaaaa 30751 
catacaaatt tcttctaaaa agcagagtaa tagccaaaat atgaagctaa tttttgaaaa 30811 
aacgaaaaat tttcgatttt ccaaagaatc gaaaaatcga aaaatgacac ccttgccccc 30871 
aactatctct gtatattatt catctattat tgattgtttc tttttgttcc tcgaaatttt 30931 
ttgaaattaa agttctcttc cccaccccga tttccgttgc tttattaatc gcgattgatt 30991 
aattgttttt ccataaatcc ccaactattt atctctgtat attattcatt tatattattt 31051 
atcttttatc tgtgtcgatt tacggtatct ccgggccgta tgattttgaa ttctcttctc 31111 
aaataaaatt gtttttcatc taacatttga tacgtgtttt tctgattttt ttgtatatat 31171 
attttccatg tatatatttc tttttctttt ttctttgctc caactttatt ttaaataatg 31231 
cttttttatc aagagatttt ttaaaaaatc gatttttttt aaagccagga attctgaaga 31291 
atcgaaaaaa atggaactat ttttcaaata atgagaaagt tttttttttt caagaaaaaa 31351 
ataataaaat tctgattttt ttaataaaaa tttaataagt ttttgaagat tttcattgaa 31411 
aacatctaaa ctattcgatt tttgatttta aattttgaaa atagaatttt ttaatatatt 31471 
tttttcaaat cgttaaaaag agaatgccgg aattttttaa aaattctttaaatttagaaa 31531 
taatcggaaa attttcgatt ataaaacgct gtataaaacg aaaaaaagtg gattttgatg 31591 
aaagaaaaaa ttttcttgta gtttttttca gaaaaaaatt actttttatt ctccattttt 31651 
tgttgttgaa tttttgagaa aaaactcatt ttgaaaaaat cgaatttttt atattttttc 31711 
taatcgtaaa aaaaatttta aaaatgaatt ccggtaattt tttaaaaaat aatattaatc 31771 
tatagttttg tagttaaaaa aatgtttcac ataaaaatct aaaaattttt gattttaaat 31831 
taaaaaaaaa tcgaattttt taaaattttt ttcaaatcgt aaaaaaagaa acaataaaca 31891 
aaagaattcc ggaaaaaaat tatattatga ttataaattt atagttcttt acttttttaa 31951 
aagattttat tttaaaaatt ctaaaatgat cgatttttgg ttttttaaaa taatcaaaaa 32011 
tgtttgattt tttttaaacg tgaaaaaaat gcaaagaaaa tgaaatccgg caaaaattgt 32071 
aatataatta taaatctata cttttgtggt tttttccaat atttctataa attcttgatt 32131 
tttaaaataa tcaaaagttt tgattttaaa ttttggaaaa attgaatttt tgtatatttt 32191 
tctaatcgta aaaaaatttt taaaaaaatc gaaagcggat ttttttctgc tattttgttt 32251 
tttttttgaa aaccggaaaa aataccaaaa attgatagtt tcgaccactc tggctagact 32311 
_accaaaattg -aatttttttt ttcgaattga gaatggccgt ggtctcatca ^gtagctagcc 32371. 
attctctttt tatttcaatt tttaagaaaa aagtctctaa aattttgaaa aaatcgattt 32431 
tttttactta ctttgatact ttttttatat cttttcaaat cttaaaaaac aattttaaaa 32491 
attgaattcc ggaaattttt ttaaataata taaatctata gttttttagt ttttaaaaaa 32551 
"tat^ttttta'^taaaaatcta^aakagttcgg 

tgttttaaat tttgaaaaaa tataaaaaat tcgatttttt caagataaaa aagcgaattt 32671 
tttgaatttt tttcaaatcg taaaaaatgt ctgtagtttt tttaaagact ctcataaaaa 32731 
tctgaaatgt tcgatttttt atttttaaaa taattttaaa aaaattttaa tattttttat 32791 
cgtgcgaatt ttttaccaac tataatttgg aataattttc acjqatctcaa aatatcccac 32851 
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aatcgcgcaa atatgccagg aagcaatgaa gattggataa agaaggaggt cgaggaccag 32911 
gacaccaacg ccaacagctc gagctccagc atagccgtct cgcgtcagct cgaagggaat 32971 
tctgctgttc ctgacgccat cgaccttctg tcttctcaaa tcaaaagaga agttgaagag 33031 
gaggatgatc gcaacgatga gactggaccc cgttcggagc ccgtggatgt taagccgtct 33091 
ccaaaacgcc caacgaagag gtcagccgag acctggacga cggctcggcg ccaagcaaga 33151 
aacggtctac ggcgggagac ggttcaactc atcgattcgc gtatgtgaat gttggagtcc 33211* 
gccatccata cgatccacgc catcttgtca tggaaacttc attgaatgaa attaggtaag 33271 
gaattattga aaataattat tatatatcca ttttaattca attttttttt tcagaatcga 33331 
agatttcgaa ataatccagt atcttccgat gcccttcagg acttcga.ttc ccatgaagct 33391 
agtgatcttc gcagtgagaa gtgaagaatc tgccgagaag atccgctcgt taatcgatcc 33451 
ttcgat^tgg atcgcggctt' ttggtggcgg aaccgaaact caaaaattct tgtggagcga 33511 
gctgacggtg gaggatttcg tcaaggcaca cataatggcc agcaggtaag ctttcgaaca 33571 
tacttaattt tttaaaaact aaaattcagc gcaaccgatg acgtgccata tgaggcagcc 33631 
atggcggatc gagaatcgct caaacaagct gtaaatgatg ccagctctct gaaaggcttg 33691 
aaggaggtaa taatttagaa atgacagaaa atgaaccgtg atgacgaaat acatctgtaa 33751 
aaaaattata aaaaattcta agctccgttt ttaatttttt ttttcagtta tattctgtca 33811 
tagcggccta tttctctgga aaaaaaaatc caaaatagcc tcaaattcgg aattatgctt 33871 
cgattttttt tctgcggtag tcctgaattt aagacgattt tgaatttttg tagctgcctt 33931 
tcgccacaat tacgttaaac atttcagagc atgtcgaaag ctggatggag gatcgtgagt 33 991 
aagatgcgga aagatctcaa tggagcctga tgatcccctt cccagcacac aagacagttt 34051 
taattttgtg tctgtatagt tttatattaa gttttgatga taatgaattt ttttacggtt 34111 
ttatccatca cttggctcga ttgaagctcc tattgtgcag cacacacggc gtgtaaatta 34171 
gtgcatctaa cctaggaaat gcgatttcta ggccatggcc gaggatccga ctagatcttt 34231 
tttgatggtg tttgtacaga gttaaatttc attttggagg gaaattgaag gaaattgaaa 34291 
gagaaattaa tttaataata ttaatttgat ttaaatgacc agaacaaaac aaataaactg 34351 
aatgacaagc caatcgatat tcgtccagac tgggatgatg ttatatgaac tctttcacct 34411 
gaaacattta agttttttta ataaaagagc aagcgcgctc aaacgcgaaa acgctcgatc 34471 . 
cacttaatct ggattttgtg ccgattcatt tatttcaagc tatgctcgtt tttttctgtt 34531 
atgtttcatt aaaaagaccg aaaacataac aaaaagtgcc tgaaaacgaa aaaaaaccgg 34591 
cgacattaat tgaaaaattc aaaactacaa tttcgccgcc aaaacccaac gagacccaaa 34651 
gtttcagcgc ggagcgtttc cacttggccg tggagcgcgc ttgtatataa aaggacttaa 34711 
ttttttaaaa tacttaccgc agttacttcc aatgtatgtc aaattcactc gattctccat 34771 
tgcagggtta ctaaaatatg ctccaaatag, ttggcaaggc gttgacttga ataaatcggg 34831 
atggttatct tggatgattg cagttcgatt tccttttgta attatgttct aaaaagtcat 34891 
tgtaatcatt taaaagtgga gtagcgccag tggggatttt gtctaaatgc acttattatg 34951 
atccaaaaca accgaatatc atcataaaac actccaaaaa gtttagtttt ttcataattt 35011 
cctgtcaaag ttttggcaaa ttggcaaaat tttgaaaaat gcgagctttt gaggtaattt 35071 
aaggaaatgt cgcatgtttc gacccctaca attatttaat acagataatt taaacaaaat 35131 
taaaacataa aaatgtagaa attttttttg ttttggtcga tttccaaaat tatgagtggc 35191 
aaaaactgag taattgccac tttttgacag taaataaaaa atgttcaaaa ttttttgaaa 35251 
cgttttatca tgatatttgg ccattatggg agcaaatgag tggtttatct attttttcac 35311 
tggcgctact ccacctttaa gcatgtctgc ctcaccataa tcccatttaa tccaacgttt 35371 
cttagatttg gattcgaata tatttgaatg actggaaaat atgttacgtt accattcaat 35431 
gcaccaatat aagtcatttg atcgagaaaa ttcaaatcgg tgagatttgt gtttctgata 35491 
gtcaatgttc cgaataaaaa ttgtaacact cctaatttgg aaacatattt ttcatcttca 35551 
tggtctatta atagatctcc aaggatatac atacatgtat ctgatagttt gctcattgat 35611 
tcaaatgtgc aataaaatga cgcatccaat ggaccaggat ctttgcaaag tttcgcttca 35671 
atgttttcag tagaaattcc aaggttcaat agggcaacta tctcagtaat ggtgacacaa 35731 
aaatcaggat gaaggttttc aaaattgaag tattgccttt tattgtatgt actgtattgt 35791 
a^catactgg tttgctcaac tgtatctata actttctgaa attttatgtc attattttca 35851 
gaaatcgcac taggcaggca agcctgcctt accgtcagaa ttggcagtcc cagtcgaatc 35911 
atttccggat tatcttgtac attcaatgct acactagcta tatccgagtt atattcgata 35971 
gtttgcaggt tttgtaaaaa cgacaaactc tgtagattag tgttccgaat tgcaatagat 36031 
cctcgaatca ttgtgacatt caaaaatgaa tcataatcga aggttgcatt aatattcact, 36091 
aaatttagac cagaatctag agttttgcat ttggagtact ccttaacatt tgatacatta 36151 
actttttcac catcacatcc tgaaatttga ctatttttat actgttaaaa aattgtttct 36211 
caccacaatc ctttaagttc cctctgacaa tgagctcatt atacatgtgt aaaaaqccgc 362-71 
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^oa aaattccaat ttcqqattat tctcgattct aatatcacac gcctcgatac 36331 
"cZTclTs g c agSg aga^Staga gcacLtggg gtcgtttaat tgtgaattgt 36391 
ttcStgta aacaccltc? gLatctgaa gtttaagaaa aaattaa^ta agttttaatc 36451 
llrlTall<ia tccgtttttg ttgaaagtat caaaaaatta actggagtca gaatgtctca 36511, 
tttcgfttfg atc?tcaaa! aa'tgcglgag ttcagaccta gacatctcgt ctgatttc c 
^;«n?taaqa gcgttctgac gtcacaattt ttctgaaaaa atattcccgc attttttgta 36631 
«?caaat?a laaJgagaca ^cctgacacc acgtggagtt ccttatatac aaaaaagttg 36691 
Itttttcgct cgtgat?ttt cgttgtaaca tcatgaaaaa tccagtgttc tctgcaaacc 36751 
a"aaaa?cc act?ttttgt ttcagccgct ccgcaagcag cttcgtcgag gtcatggcag 36811 
cqgccgagtt tcccactccg ctgaaactcg gcacttaata tatgaacgac taagctagca 36871 
cggcciccat tctaccttac cagcaaaaat gaattcgttc acttacacac atcacacacc 36931 
aSSaagt ttcctttttc tttgtcagct gtaaaaaccg aaaggcttgt cagactagta 36991 
ttctcaatat taaatc 
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ssl-l Predicted exons: 



Exon . 

J '. — ■- 

2 
3 
4 

^ — 

_6 

7 
8 


Position in eenomic sequence (inclusive"^ 

1001-1281 

1923-2027 

2084-2312 

4420-5205 

5855-6487 

7685-8515 

9700-10184 

12211-13165 

1^643-13726 


9 
10 
11 
12 


13796-13939 
18879-19101 
20449-20735 
21661-22273 


13 
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Figure 20B 

ssl-l cDNA 



tcagtggctg atgatcagcc f^^cttcg tctgcggtgg aaagacatca 180. 

gccatagaaa ctgatgaaga tgcggtagtt 9a39aggaga ^ ctacatttgc 240 

latgatttgg aaattatcac tccaagaact «^9tcgatc 99 9 ^g^ggtcca 300 

?cgattcttt tgactgaaaa tcgatcgatt cgcgataaat tjg 9^9 gccagttgtc 360 
gttcgtcaag aagatcacga agaacagatt 9Ctcgag«c ^« „ tattctgaaa 420 
laJcLattc aacgagtcga 9caaatcata ctcaat9gtt cagtgg^J^ 3,etgcacct 480 
latcctcgat tcgcagtaat 9gcagatctc acaaaagaac ^^^^ ttcagagggc 540 

cctcctccaa tccagaagac aatgcaaccg ^"9^99 f gaatgtggaa 600 

tcaaatacgg ctcaaccgag tgttctgccc agttgtgg 9 9 93 taagaacggc 660 

agagccgcca aaagagaagc 9catgtattg 9Ctcgaatcg 9 9^^^^^ aacgcattgg 720 
ttatggtcga acagtcgtct gccaaagtgc 9tcgaa^ct9 ^cclaaccga gacgaatacg 780 
gattatctac tggaagaggt caaatggatg 9"9"g cgacaagcag 840 

aagcgaaaaa tcgccaaagt tatagctcac g^attgcga 9^^^^^ gtgtgcagga 900 
atcgagattg agagagccgc cgaacgggag atcaaggaga 9 9 ^attcgagcg 960 

atcgcgaaga tggtacggga tttctggtcg tctacggata g^^g^ gtttgtaatt 1020 
aaggaagttc tggagtcgag g^tcaggaag 9cgagaa 9 ^^^^^ gtcgaaatcc 1080 
ggacaagtcg atgaaatgag caatattgtg "agaaggac 9^ ^^^^ tggctctgat 1140 
ccatcaattg catcggatcg agatgataaa gatgaag aaaggaagat 1200 

tcagaatctg acgatgagca gacaattgca aacgcgg « tgactttttg 1260 

gttcgacagg aagttgatgc tcttcaaaac 9J9gcaactg t9g 99^ tltggaggag 1320 
?acactttac cgccggaata tctgaaggct ^atggtctga 9 99 ^ ^gataatgag 1380 
atgaagcgcg agaaattgga ggagcagaag 9=tcggaagg 9 9^^^ cacctcaagc . 1440 ■ 
galaaaatgg agattgatga aagcccatca ^cagatgctc 9 cggtgatggt 1500 

?cagatctca ccgccgagca gcttcaagat "aacagctg a g 99 ^ 1550 

catlgtgtac ttgaaaacgt 9gattacgtg aagctcaaca gtc^gg^^^S ^gatataca 1620 
caacaagagt tggcgaatat cgcagaagaa g^9Ctga I actgagagaa 1680 

cttgagacga cacaagtcaa gacgcccgta ^^^^^^^^^^ agalgalttt gaatggaatt 1740 
tatcaaatgg ttggattgga ttggatggtt acactttatg 9 9^^ ggctcatatg 1800 
cttgccgacg agatgggcct gsgaaagacg attcaaacga ^ tgtcattctg 1860 
gcttgtagtg aatcgatttg ggjaccacac ttgattgttg tg g^^g^^ gtattttggt 1920 
aattgggaga tggagttcaa gaaatggtgt «9gctccg 9 tttccatgtg 1980 

acggcgaagg agcgtgccga g-gcgga-g c'tttLagca gagggcctgg 2040 

tgcatcacat catacaagac gg^tactcaa gj"J^^ 9^ acgttggcag 2100 

cagtacctaa ttctcgatga agctcaaaat atcaaaaact gg g acttcagaac 2160 

gctcttctga atgtccgtgc tcgacgtcgc ^ttctcctga 99^^^^^^ ctcaagtcat 2220 
tctctaatgg aactgtggtc gttgatgcat "tttgacgc aaatatggaa 2280 

gatgatttca aggattggtt ctcgaatccg ttgacaggga tgat9g gg ^340 
?tcaatgctc cactaatcgg acgacttcac aaagtgctcc 9 9 ^tgttcg 2400 

ctcaagaagg aagttgagaa gcagctgcca 9agaagactg ag 9^ aacaaaggag 2460 

ttgtcaaagc ggcagagata c:ctgtacgat g^^"^^^^^ ?ga?gcaact ccgaaaatgt 2520 
aatctaaagt ctggaaatat 9atgtcggtg ^^^^ttcgt cgttgagaag 2580 

tgtaatcatc cgaatctctt cgagccgcgg ^cagttgttg 9 9^ ctcctcctcc 2640 
cttcagctcg atgttccggc tcgtctcttt 9«^«^tt g 9 ^^^^^ atcttccgtt 2700 
tcagctagtc aaattccgga aattttcaat "atccaaaa 99 ggagccacga 2760 

cgatctgcaa aaccactcat cgaagagctt Saagcaatga ^ gaatccgcat 2820 

g?accagaag ttggcggatt tcggttcaat ^9gacggctt ^ 9 9 ^^^^^^^^^^ 2880 
Lggaagagt cggaggacga aggjg"atg tf lf llll ^^^aLctcc aaatcgtcca 2940 
caaacttcat gcattcgttc aaaaaccgtc g^jaatacag "eg 
llllllllll llllllTclc ITsl^lll ^gtfagSrc ?cfc4aac tgcaacgaa. 3120 
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. =»-nt-hrcaca acacqttqtt gaaaagatgg aagaatcgag aaaaaaccag 3180 
cScfaattg tlTsllllcl atttfagftg !ttcg?gcrc cgattattcc actggaaatg 3240 
cScgctgl tlcUlssl aattLtgca gaatttccac gtttggctgt ggaagaggac 3300 
faglt^gtgc aggaglgict tttggagtat tgcgagttgt tggtgcaaag Jttcggaatg 3360 
Scgtcgaac cagtgctgac cgatgcttgg cagtgtcgtc catcatcgtc tggtcttcca 3420 
^cSatattc gcLcaattt atcaastatc gagctgaatt ctcgttctct tctcctcaac 3480 
acctccacta ftttcgatac ccgaatgtcg atctcacgtg ctcttcaatt cccagaactc 3540 
cgtctgatcg agtac|attg tg|aaagctt cagacgttgg ctgttctgct tcgtcagttg 3600 
tacctqtaca agcacagatg tctgatcttc acgcaaatgt caaagatgct cgacgttctg 3660 
«gaccttcc' t?tctc'atca cggttatcag tatttccgcc tcgacggtac cactggt3tc 3720 
gaacaaagac aggcgatgat ggagcggttc aacgcggatc ccaaggtgtt "gcttcatt 3780 
Sgtcga^ga gltccggtgg tgttggagtc aatctaaccg gtgctgacac Jgtgatcttc 3840 
tacqaltcgg attggaatcc gacgatggat gctcaggctc aggatagatg tcatcgtatc 3900 
qgacagacia ggaa^gtctc iatttatcga ttgatttccg agcgaacaat tgaggagaat 3960 
aftctlagia aggcaacaca gaagcggcga cttggagagt tggcaattga cgaggctggc 4020 
«cacacccg a^tcttcaa acaatctgac agtattcggg atctttttga tggagagaat 4080 
-gtggaagtga ctgctgtggc agatgttgcg acgacgatga .gcgagaaaga aatggaggtt- 4140 
IclItglcL ag?gtgaaga tgaagctgat gtgaatgcgg cgaagattgc 99tggccgag 4200 
IcLaclttg a?aatgcgga gtttgatgag aaatcattgc cgccgatgag caatttgcaa 4260 
qgagatlagg aggctgatga gaagtatatg gagttgatac aacagctcaa accaatcgaa 4320 
claStlcca t?aaclttct tgagacacag tacaagccag aatttgagga agaatgcaaa 4380 
gaggcagagg ctcttatcga ccaaaaacgc gaagaatggg acaaaaatct caacgatacc 4440 
Lcltcatti acctcgacga ttcggatagt ctgctgctca acgatccttc gacttctgcc 4500 
gat?tttatc agagctcaag tcttttagac gagataaaat tctacgacga gctggacgat 4560 
atcatgccaa tctggcttcc accatcacca ccagattcgg atgcggattt cgacttgaga 4620 
atggaagatg attgtctcga tctgatgtat gaaattgaac aaatgaacga ggctcgccta 4680 
cclcaagttt gtcatgaaat gagacgtccg ttggctgaaa aacagcagaa acagaacacg 4740 
ttgaatlcgt ?taatgacat tctatcggca aaagaaaagg aatcggtgta cgatgcggtc 4800 
aacaag?gcc ttcaaatgcc acaatccgaa gcgatcacag cagaatctgc agcgtctcca 4860 
Qcatacacgg aacactcatc attctcgatg gatgatacaa gccaggatgc gaagattgag 4920 
ccaagtttia ctgaaaatca acaacccacc accaccgcca ctactactac tacagtaccc 4980 
caacScaac aa?aacagca gcagcaaaaa tcgtcgaaaa agaagagaaa tgataatcga 5040 
acggctcaaa atcgaacagc tgaaaatggt gtgaaacgag cgacaactcc a^caccatca 5100 
tgicgtgaag agccagatta tgatggagcc gaatggaata tagttgaaga ttatgcacta 5160 
ct?caa|cai t?caagtcga atttgcaaat gctcatttag tcgaaaaatc g9cgaatgag 5220 
ggaatggtgt tgaactggga attcgtgtcg aatgccgtta ataagcagac aagatttttc 5280 
cLtcggccc gtcaatgctc aattcgatat caaatgtttg ttcggccaaa agagctcgga 5340 
cagttggtgg cttctgatcc gatttccaag aaaacgatga aagtcgacct atcgcatact 5400 
gaattalctc atttgagaaa aggacgaatg actacggaga gccaatatgc tcatgattat 5460 
ggaatattga ctgataagaa acatgtgaat agatttaaaa gtgttcgagt ggcggcaaca 5520 
cggagacctg ttcagttttg gagaggccct aaaggtagag gaggatggct tcataatagt 5580 
caltgcaact ttttcctcac gagggacgag aaaaagtggt ttctaggcca tggccgaggt 5640 
gccgacaagt ttcagc 
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ssl-1 protein 

<400> 3 

Met Pro Ala Thr Pro Val Arg Ala Ser Ser Thr Arg lie Ser Arg Arg 

1 5 . 10 15 

Thr Ser Ser Arg Ser Val Ala Asp Asp Gin Pro Ser Thr Ser Ser Ala 

20 25 30 

Val Ala Pro Pro Pro Ser Pro lie Ala He Glu Thr Asp Glu Asp Ala 

35 40 45 

Val Val Glu Glu Glu Lys Lys Lys Lys Lys Thr Ser Asp Asp Leu Glu 

50 55 60 

He He Thr Pro Arg Thr Pro Val Asp Arg Arg He Pro Tyr He Cys 
65 70 75 80 

Ser He Leu. Leu Thr Glu Asn Arg Ser He Arg Asp Lys Leu Val Leu 

85 SO 95 

Ser Ser Gly Pro Val Arg Gin Glu Asp His Glu Glu Gin He Ala Arg 

100 105 . 110 

Ala Gin Arg He Gin Pro Val Val Asp Gin He Gin Arg Val Glu Gin 

115 120 125 

He He Leu Asn Gly Ser Val Glu Asp He Leu Lys Asp Pro Arg Phe 

130 135 140 

Ala Val Met Ala Asp Leu Thr Lys Glu Pro Pro Pro Thr Pro Ala Pro 
145 150 155 160 

Pro Pro Pro He Gin Lys Thr Met Gin Pro He Glu Val Lys He Glu 

165 170 175 

Asp Ser Glu Gly Ser Asn Thr Ala Gin Pro Ser Val Leu Pro Ser Cys 

180 185 190 

Gly Gly Gly Glu Thr Asn Val Glu Arg Ala Ala Lys Arg Glu Ala His 

195 200 205 

Val Leu Ala Arg He Ala Glu Leu Arg Lys Asn Gly Leu Trp Ser Asn 

210 215 220 

Ser Arg Leu Pro Lys Cys Val Glu Pro Glu Arg Asn Lys Thr His Trp 
225 230 235 240 

Asp Tyr Leu Leu Glu Glu Val Lys Trp Met Ala Val Asp Phe Arg Thr 

245 250 255 

Glu Thr Asn Thr Lys Arg Lys .He Ala Lys Val He Ala His Ala He 

260 265 270 

Ala Lys Gin His Arg Asp Lys Gin He Glu He Glu Arg Ala Ala Glu 

275 .280 285 

Arg Glu He Lys Glu Lys Arg Lys Met Cys Ala Gly He Ala Lys Met 

290 295 300 

Val Arg Asp Phe Trp Ser Ser Thr Asp Lys Val Val Asp He Arg Ala 
305 310 315 320 

Lys Glu Val Leu Glu Ser Arg Leu Arg Lys Ala Arg Asn Lys His Leu 

325 330 335 

Met Phe Val He Gly Gin Val Asp Glu Met Ser Asn He Val Gin Glu 

340 345 350 

Gly Leu Val Ser Ser Ser Lys Ser Pro Ser He Ala Ser Asp Arg Asp 

355 360 365 

Asp Lys Asp Glu Glu Phe Lys Ala Pro Gly Ser Asp Ser Glu Ser Asp 

370^ ^ - 375-^--- . .380_ - 

Asp Glu Gin Thr He Ala Asn Ala Glu Lys Ser Gin Lys Lys Glu Asp 
385 390 395 400 

Val Arg Gin Glu Val Asp Ala Leu Gin Asn Glu Ala Thr Val Asp Met 
405 410 415 
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Asp Asp Phe Leu Tyr Thr Leu Pro Pro Glu Tyr Leu Lys Ala Tyr Gly 

420 425 430 

Leu Thr Gin Glu Asp Leu Glu Glu Met Lys Arg Glu Lys Leu Glu Glu 

435 440 445 

Glh Lys Ala Arg Lys Glu Ala Cys Gly Asp Asn Glu Glu Lys Met Glu 

450 455 460 

lie Asp Glu Ser Pro Ser Ser Asp Ala Gin Lys Pro Ser Thr Ser Ser 
465 470 475 480 

Ser Asp Leu Thr Ala Glu Gin Leu Gin Asp Pro Thr Ala Glu Asp Gly 

485 490 4SS 

Asn Gly Asp Gly His Gly Val Leu Glu Asn Val Asp Tyr Val Lys Leu 

500 505 510 

Asn Ser Gin Asp Ser Asp Glu Arg Gin Gin Glu Leu Ala Asn lie Ala 

515 520 525 

Glu Glu Ala Leu Lys Phe Gin Pro Lys Gly Tyr Thr Leu Glu Thr Thr 

530 535 540 

Gin Val Lys Thr Pro Val Pro Phe Leu lie Arg Gly Gin Leu Arg Glu 
545 550 555 560 

Tyr Gin Met Val Gly Leu Asp Trp Met Val Thr Leu Tyr Glu Lys Asn 

565 570 575 

Leu Asn Gly lie Leu Ala Asp Glu Met Gly Leu Gly Lys Thr He Gin 

580 585 590 

Thr He Ser Leu Leu Ala His Met Ala Cys Ser Glu Ser He Trp Gly 

595 600 605 

Pro His Leu He Val Val Pro Thr Ser Val He Leu Asn Trp Glu Met 

610 615 620 

Glu Phe Lys Lys Trp Cys Pro Ala Leu Lys He Leu Thr Tyr Phe Gly 
625 630 635 640 

Thr Ala Lys Glu Arg Ala Glu Lys Arg Lys Gly- Trp Met Lys Pro Asn 

645 650 655 

Cys Phe His Val Cys He Thr Ser Tyr Lys Thr Val Thr Gin Asp He 

660 665 670 

Arg Ala Phe Lys Gin Arg Ala Trp Gin Tyr Leu He Leu Asp Glu Ala 

675 680 685 

Gin Asn He Lys Asn Trp Lys Ser Gin Arg Trp Gin Ala Leu Leu Asn 

690 695 700 

Val Arg Ala Arg Arg Arg Leu Leu Leu Thr Gly Thr Pro Leu Gin Asn 
705 710 715 720 

Ser Leu Met Glu Leu Trp Ser Leu Met His Phe Leu Met Pro Thr He 

725 730 735 

Phe Ser Ser His Asp Asp Phe Lys Asp Trp Phe Ser Asn Pro Leu Thr 

740 745 750 

Gly Met Met Glu Gly Asn Met Glu Phe Asn Ala Pro Leu He Gly Arg 

755 . 760 765 

Leu His Lys Val Leu Arg Pro Phe He Leu Arg Arg Leu Lys Lys Glu 

770 775 780 

Val Glu Lys Gin Leu Pro Glu Lys Thr Glu His He Val Asn Cys Ser 
785 790 795 800 

Leu Ser Lys Arg Gin Arg Tyr Leu Tyr Asp Asp Phe Met Ser Arg Arg 

805 810 815 

Ser Thr Lys Glu Asn Leu Lys Ser Gly Asn Met Met Ser Val Leu Asn 

820 _ . 825 . 83jp _ 

lie Val Met Gin Leu Arg Lys Cys Cys Asn His Pro Asn Leu Phe ciu 

835 840 845 

Pro Arg Pro Val Val Ala Pro Phe Val Val Glu Lys Leu Gin Leu Asp 

850 855 860 

Val Pro Ala Arg Leu Phe Glu He Ser Gin Gin Asp Pro Ser Ser Ser 
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870 

Tel Ala Ser Gin He Pro Glu lie Phe Asn Leu Ser Lys lie Gly Tyr 



885 



890 



Gin ser Ser Val Arg Ser Ala LyB Pro Leu lie Glu Glu Leu Glu Ala 

900 905 
Met ser Thr Tyr Pro Glu Prd Arg Ala Pro Glu Val Gly Gly Phe Arg 

915 920 925 

Phe Asn Arg Thr Ala Phe Val Ala Lys Asn Pro His Thr Glu Glu Ser 

930 9'35 940 

Glu ASP Glu Gly val Met Arg ser Arg Val Leu Pro Lys Pro He Asn 

950 

Glv Thr Ala Gin Pro Leu Gin Asn Gly Asn Ser lie Pro Gin Asn Ala 

' 965 970 

pro Asn Arg Pro Gin Thr Ser Cys He Arg Ser Lys Thr Val Val Asn 

980 985 990 

Thr val Pro Leu Thr lie Ser Thr Asp Arg Ser Gly Phe His Phe Asn 

•- - 995- -- 1000 " - — ■ 1005 „ 

Met Ala Asn Val Gly Arg Gly Val Val Arg Leu Asp Asp Ser Ala Arg 

1010 1015 1020 

Met ser Pro Pro Leu Lys Arg Gin Lys Leu Thr Gly Thr Ala Thr Asn 

nn-je; 1030 ^^^^ " 

i« ser ASP Tyr Val Pro Arg His Val Val Glu Lys Met Glu Glu Ser 

- - 1050 lw5o 



1045 



Arc Lys Asn Gin Leu Glu He Val Arg Arg Arg Phe Glu Met lie Arg 
'^^ ^ 1060 1065 1070 

Ala pro He He Pro Leu Glu Met Val Ala Leu Val Arg Glu Glu lie 

1075 1080 1085 

lie Ala Glu Phe Pro Arg Leu Ala Val Glu Glu Asp Glu Val Val Gin 

1090 1095 1100 

Glu Arg Leu Leu Glu Tyr Cys Glu Leu Leu Val Gin Arg Phe C31y Met 

1105 1110 11" 1120 

Tvr val Glu Pro Val Leu Thr Asp Ala Trp Gin Cys Arg Pro Ser Ser 
^ 1125 1130 1135 

ser Gly Leu Pro Ser Tyr He Arg Asn Asn Leu Ser Asn lie Glu Leu 

1140 1145 1150 

Asn Ser Arg Ser Leu Leu Leu Asn Thr Ser Thr Asn Phe Asp Thr Arg 

1155 1160 1165 

Met ser He Ser Arg Ala Leu Gin Phe Pro Glu Leu Arg Leu He Glu 

1170 1175 1180 

Tyr ASP cys Gly Lys Leu Gin Thr Leu Ala Val^Leu Leu Arg Gln^Leu 

iir^eu Tyr Lys His i^g^Cys Leu He Phe Thr Gin Met Ser Lya Met 

1205 1210 1215 

Leu ASP val Leu Gin Thr Phe Leu Ser His His Gly Tyr Gin Tyr Phe 

1220 1225 1230 

Arg Leu Asp Gly Thr Thr Gly Val Glu Gin Arg Gin Ala Met Met Glu 

1235 1240 1245 

Arg Phe Asn Ala Asp Pro Lys Val Phe Cys Phe He Leu Ser Thr Arg 
■ ••—•1250 ■ ^--'••'^ 1255--^ - - -: -1260--:. 

ser Gly Gly Val Gly Val Asn Leu Thr Gly Ala Asp Thr Val He Phe 
1265 1270 1275 12B0 

Tyr Asp ser Asp Trp Asn Pro Thr Met Asp^Ala Gin Ala Gin Asp^Arg 



1285 



CVS His Arg He Gly Gin Thr Arg Asn Val Ser He Tyr Arg Leu He 

' 1300 1305 1310 

ser Glu Arg Thr He Glu Glu Asn He Leu Arg Lys Ala Thr Gin Lys 
1315 1320 1325 
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Phe Ph. W. Gl° "0 "5= 

: I r«« .y« cv. - - - 

Z u 1 r 0. ™. - - - - 

Arg JEyr Ala a ^^^^ 

, «lu Cys Lys Glu Ala Glu Ala Leu He Asp Gin Lys Arg^ 
Glu Glu cys ^y^^ 

T« ASP tys Asn Leu Asn Asp Thr Ala Val He P 

! r« I - - - - f'or ^ 

«p - - - v.ir . 

K« P.0 X. rP.0 «c .e. «c P.O .SP S« «P «.^"P 

: «p .p r.. - n. 

: p. s=«a .y= - 7 

Mn Asp "« " 1550 Thr Ma Glu Ser 

;i°\y. cya Gl„ Hat Pro Gl„ ..r G u «. n. Thr Al 
"° 16OS =,r ser Phe Ser Met Asp Aap 

Ala Al. ser pro Ala Tyr Thr Gl» His ser ser Ph ^^^^ 

Z ser Gl. r4°Al. .a n. Glu^rrser .e» Thr Gl» Gl« G „ 
Z Thr iJ^Thr Al. Thr Thr r Thr Thr val Prc^O. Gl„ Gl„ Gl„ 
,1. 01. Gin .ys ^t^Ser .y. .ye ^y.^"- ^ - 

g. Aa„ Ar, ^Ala Gl» Aah Gly «1 Ar= Ala Xhr^Thr 
Trlpro-Pro s„ T^^Ar, G. Gl„ Pre aU — .f^^" ^ 
I ne val ^l^Asi-Tyr Ala LrGl. Ala val Gin «1 Gl» Ph, 
»U val G. i;rser Ala Aa. 0. G. «« val ^ 
1 - val ser il^Al. val Aan .y. «h Thr Ars Phe^Phe 

S%rr Ala Ar, Gl„ S^ser lie Ar. Tyr .h «et Ph. val Ar, «o 
.l„ GIV r ... val Ala ser Aa» Pro 11. S.r .va .y. Thr 
Titlf»- RB PATHWAY AND CHROMATIN REMODELING | 

SInes tha^^Tnttvgonize let^o ras signaling 
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1780 1785 1790 

Met Lys Val Asp Leu Ser His Thr Glu Leu Ser Hie Leu Arg Lys Gly 

1795 1800 1805 

Arg Met Thr Thr Glu Ser Gin Tyr Ala His Asp Tyr Gly lie Leu Thr 

1810 .1815 1820 

Asp Lys Lys His Val Asn Arg' Phe Lys Ser Val Arg Val Ala Ala Thr 
1825 1830 1835 1840 

Arg Arg Pro Val Gin Phe Trp Arg Gly Pro Lys Gly Arg Gly Gly Trp 

.1845 1850 1855 

Leu His Asn Ser His Cys Asn Phe Phe Leu Thr Alrg Asp Glu Lys Lys 

1860 1865 1870 

Trp Phe Leu Gly His Gly Arg Gly Ala Asp Lys Phe Gin 
1875 1880 1885 
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lin(n3628) genomic sequence (1 kb of upstream and downstream genomic sequence is 
included in this file). 



GTCAATGGAATTCTCGACGCGGATCTTGTTAGAGATGCCGTCGAGAGAGATT 

TGATCAAATTGCGGTACGCTGAAACGGATGCACCAGTTTTACAGGTAAAATG 

GAAATATACAAACTCAAAAGTAAAATTTTATGAATTTCAGATCAACAACTCA 

CTATACACGGCATCCTGGGAGCAAGATCTCGGAACAAATATGGTTCTGCAGT 

CAAAAGGAAAAGAGATGGAAGTGATTTCGTGTACATCGACCATGATGACTGC 

AGAAAAAGCCCTGTTGACCTCGTTAAGCACCGAAGGATCTACACTAGCCGCC 

AATGCAGAGACTGCTCCGAAATCTGATCTCAGTCGAACTCAACCACGTCAAC 

AATGATTTTCAAAATATAAATTAACATGAAGCTCTGAAATAAACTCATATAA 

CTGCTAAAATAAAACTGTTGCTTTTGAAACCAACATTTGTTAGACAACCTGCG 

TCTCACAGTCATTTTTCAATATATTGGCGCCGCGCACACACAAAGAAGAAGA 

ATTCGTCCTCATGGCATGGCATGTGCAGTCAGCGGCCACCCTGTGTAACCACT 

GCGTATCGCATCTTTCCACGTGTTTTTGCAATCTTGCTGTCACGTTCATTTCCT 

CGTACAACCATCTCTTCTACCCCCGTTGCCTCCTCCACCATCTCATCTCAATTG 

TGTCGTTGCCCTCCCTCTCCCCAAGTCTTTCTGCGTCTCTTAGTGCTCTTCGAG 

AAAAGAACGAGGAGAGCTGTGAGACGCTAGTAGGAAACGCATTCTCAATTC 

GATATAGGCACATTGAGAGAGAGCGAGCGCCGTTTCGACGTCTTCTAGCCTT 

CACATCATCCAGACGACGTTCACACGCACACACAGCCAACCCCACCCTTCTG 

ACAACGAATAGACGACGAAGAAGAGAAGAAGAAAAAGAAGAAGGTACCCA 

TTTTTCATTCCCTTTTTGCCTCCACACTTCACTATTATCGATTTTGTGAGCGAG 

CTCTAATGmCAACGCAAAGTGCTATTGCXTAAAAAGCGGTGAGAATTTGCT 

TCAGACAlGAAATtCGm 

GAAGGTCAATTTTTACTTTCAACGCTCTTCATTGACGGAAAACTCGTTTTTCTT 

TCAAATTTTAAATTACAGAGGCATTTTACTCAAGGTTTGTTTTAATTTAAATT 

AAAAATAAATTTTAAAATAGAAATATGGATAATATAAAATGTTTTCTTCAAA 

AAATGCACTCAGGTTCACCAAAAAATCGATAATTAAAAATACGGTCGCAAAG 

GAGCGTCGTTAGCTGCTAATCAATGGTCTTAAAACGAAATCTATCGATTTTTG 

TGTACTACACACGGACAAGTGCTCCACCGTTATTTTTTGAACGAGTGCGTTGC 



Ex on number 



Exon boundaries (inclusive) 



2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 



1001 - 1035 

1920-2062 

2114-2190 

2241 -2501 

2551 -2903 

2955 - 3405 

3497-3631 

4227-4690 

5293 - 6058 

6696-7058 

7609-8338 

8771-8933 

9511-10306 

10774- 10851 
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AATTCCATCCCATTTTGACGTTTTTCTTTTTTTTTTCATCAAATTTTTTAGCATT 
TAAAGTAAAGTCAATGATAACCTGCAAATAATAATGTAAAATTCATTAAAAA 
CCGAGAGAAAAAGTCTAAAGTCATAAATTTTTGATAAAAAAGTGATTTTCGA 
AACTAAAAATCATTCAAATTAAAGTTGAACCTGATTCTTCAATTTTTATTATA 
TATTAAAAGCTTGATCCACTCAAATAAAAGGAGTTTTTAATTGAGAAAAAAA 
GCAAATGAAAAAATCGATAATTAAATTGGGCGCCAACCTAGATTTTAATATG 
TTTTTGTTAGAAATTTGTATATTTTCATCACTCTCTGACTTTAAGCATTC GTAT 
TTTAAGGAAGTGTGAGCTTTCTAATATGTTTTTTATTAAAAAAAACATGTTTT 
TAACAATCTCCCTGTCATCCCCATCACCTAATGCACTCAAATAATCAATAATC 
ACAATACTTTTATTTTTTCTTGCAGAACAGAAATGGTCCAAACGAGACGAAA 
GACAGCTGCAGCTGTACAGGACGGTGGTGCCGTTAAGGAGAACAAAGCCAA 
GCCACCTGCCCCTCAAACGCCTACAAAACGAGCAAAACGAGGTCGTCCCCCG 
AAAATTAAGACTGGTGAGCGAATGACTATACGGAAGATTGAAAATTCACGTG 
GAATACTTGCAGATGCCAATACTTTGAATACGCCAAGCACITCTTCCAACTTG o O 
GTCGATGACAAACTTCTCATTGAGTCTGAATCACAGGTAAATTGATTCTTTTC 1 1 
TATTCAAAAATTAATCTAAACTATACATTCCAGGACTCGATTCTCACAAACGA ^ < 
AGCCGACTCTTTTCTGGAAAAAGAAGTGGAAGAAATCGA AGAT AGTTCAGAT O g | ^ 
ATACTTCCCGATAAAATTAATTCTCCAGAAAAACCAAGTGTTTTGGTGAAGC m% f I8 
GGAGATCGAGTACGCGGTTAAAAGTGAAGACTGATGAAGATGAAAAAGATG ^ g i 
TTCCTGTGAACATAGAAGTAGCCGTTTTAGAAGAAAAATCAATTCAAATCGA i- § go 
GCCAACATCTCCCGCTCACCCGGAAGATCCTCAGGTGAGCTTTTTTTAAAAAT 2 [i <^ - 
ATGTATTAATCAAAATTCCTTCATTTCCAGCCTTCGACTTCTTCTCTTCCACTG § -j S | 
GTAGAACCAATTGAAGACATTGTGGAGCCAAATGAGCCAACAAGCTCTGCCG g ^ S « 
ATCCTCCAGTATCAAATATTAAGGATGAGGATATTAAAGAAGAAGAGCCACT Q g ^ " 
GATTAAAAAGCCAGCTTCCGATGAGTCAGAATCTATGGATATAGCTAACTCT < o - 
GAAAGTGGAAATGATTCCGATTCAAGTGAAGCTGATCCTAGGACGATACCAT ^ | ^ E 
CTTTCTCTATACCTCTTCCCGACACACCACCTCCAAATTTTGCGAAAAGAGGA | < | ^ 
GAAATACATGTAGATGTAGATCAGAAAAATTCCAAGCAATCAGGAGAATCAC b !^ ^.w 5 
AATCGCCTTGGGAGCGGTAAGAATATTTATCCTAGCCAGGTGTTATAACAAA Q- f ^ 
ATTGAATAGTTTCAGAGCAAGAGAAAAGTCTGCATCGAACCCATTGTCCTCT § « § ^ ^ 
CCAACAATGAGCCGACCCAGGATACACTTCCTTCATCCAGCATATCAAAGTTT oj g =5.f S. 
CAC AAATGATTCAGTTTCACCTCTACCACCACCGCCACCAG AGCCGGCTCCA F o < il a 
GCTCGTGAAAAAGTGGAAAATGGTGGTCCAACTACTTTCAAAATGACTTTCA 
AAAAAGCTGCAAATATTCCTATCTTGAAGACATCGGCATTTGAACAACCATC 
ATCACCTCCACCTTCCTCATCAGTTTCTTCATCAATTTCATTATCTGAAGTGAA 
TTCTTCTACATCGATAGCCTCCGAGTCTTCTCCAGCGAAAAGAAGCTCAAATT 
TCGATTTAACTGCCTCAAATGAGCTTCCACCACCTCAGATGGTTGAACTTCCC 
AAGCTCTCATTTTTCAATATGCCTCCAGCCGTTCGCTCCGCAGAGGTTAGTTA 
ACTTTTTCCCGGTTTCATGAAATTTCAGCGGTATCTGTCCTCCTTTTGGTGTGT 
GCCCTCACAACCTAACCTCTTTTATCCAGGACGATTCTGCGATGACGTCGGAA 
GAACCGATCCTTCTCCTCCGTTCTCCGAATTCCGCCACTCCTGATGATGATGC 
ACirnOCTCACGACCCCACCACCA^^ 
" GCACTGGT C3AGCCAG AtCACAC 

TTCAGACCGTTTTTCTTTACACCTCATCCCCTTTTGTGTTATGTTAACATTCAT 

TTTGTGTCTCAAACACTGCATGCTTTTGCACTTGGAAATTAAAAAATAATGCG 

TTCTGGGATTTTGTGTGTTAAGGTGGAGTAGAGTTTGTGAGGCTAGAAAGTAT 

GCCTTTTTCGTTTCTCCACTGCAAAATTTCGTTTGAAAAAAACAAAAAATTTA 

CTAAAATTTGAAATTTCACCAACTTGCCGTTGTCACAGCTGCTGAAATACAGT 
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TTTTATTGCATTTTCACCCTTTATTGCATATTATTATTAGACACCTTTTAGGTC 

AATAGGCAACCGAAAATATCCGAATTTGACTTAAAATGTACCTAAATTAAGG 
AACTAACTTGAGATATACGACTAAAAATGCAATAAATTGTGAGAATTATTGT 
TATGAAATTCAGCCGTTTTAGGCTAGTTTTAGCCAAAAACCGACAAACTCTAT 
TCCAATTAATTTTCCACTCCTGCACCTCGATTAGTGATTTTTTGAAGAAAAAA 
AATTATCTTCTTATTTCAGAAAGTAGCGACGGAAAAAGTGAATCAAGTAATT 
GCTCGACGTGAAGATTCTGAAAAAGATGTACGTCACAGAGAAGATCGAGATG 
ATTATGATAGACGACGTGACGACCGTGACAGAAGATCCAGAAAGACTGATTC 
GGAACGAAATGATCAAAGAGGACGACAACGTGAAGATGATGAACGAAGAGC 
TCGAGAACGAGAAAGAGAAGTTACGAAACGACATGATCGGGAAAGGGAAGA 
GATGCGATTACAGAAACAAAAAGATGAGGAAAGAAGAAAGAAAGATGAAG 
AGGAAAGGATACAAAAAGAGAATGATGAGAAAAAACAAAAAGAGGATGAA 
GCCAAAATGGAGGAGGAGAAAAAGAAGATTAAAGAGGAGGAAATGAAGAT ^ 
TCCTGAATTTGAGTTGATTAGCGAATCAAAATATTTGACGAGGAATGCGAAT ? ? 
AAAAAGAAGACTGAATCCTTAACGTAAGTTATTATTTATAAATTTGACTTAAA ^ < 
AATTGATAACTTTCAAAATTAAGTGATTCAATAGACTCAAAAGAATGAAAAA o o ^ ^ 
CTAGAGTGCGCCTTTAAAGAGTACTGTAATTTCAAACTTTTGTTGCTGCTCAT ^ ^ g 
TTTTCATCGATTTTTCTTAGTTTTTCGTTAAAAATAATTCAACCATTGGATTAA | ^ i 
AAAAAATTAAAAACACATAAATTTTATTTTGAAAAGTAATGAGAAAAACTAT F o rod 
AGAAATTCGCCGAAAATTCTACAGCAACAAAAGCTCAAAATTACAGTACTTT 1 1? ^ ? 
TTAAAGGAGCACATCTTTCTGAATTTAACAAAAATTCGGAGATTTTTCTTTTT § g E 
TTCGTGTTTTTCTGGCGAAAAAACGATTTTTCGCTTTTACCGGAAACGGTATC g g S ^ 
CGGAGGAAAAAAAAAACGAAAAAAGCGAAAAATTTTAAGAAGTTTCAAGAT q z c^^ o 
TAGTTACAAACTCTTTTCAAAAGCAGATTCTACAGTTTTTTGGGGTTTTGCCA ,<8^a5 
AAAAATTTATGAAATATAATGTTTTTTAGACTAGAAAAATAAACTAATTTTAA ^ ^ "I 
TTTTCAATCAAAAGCTCATTATTATATTTATATTTATATAATTCAGTTGCGAAT ^ < £ f 
GCCATCGAACTGGTGGAAACTGTTCGGACAATACTTGTGTGAATCGTGCAAT h ^ I w 5 
GCTCACCGAGTGCCCATCATCATGTCAGGTCAAATGCAAGAATC AACG ATTT ^ f S oj 
GCAAAGAAAAAGTACGCGGCTGTTGAAGCATTCCACACTGGAACCGCCAAA g co | q ^ 
GGATGTGGACTTCGAGCAGTGAAAGACATAAAAAAAGGAAGATTCATCATTG bj z t g g, 
AATATATAGGAGAAGTTGTGGAAAGAGATGATTATGAGAAGAGAAAAACGA jz o <il i£ 
AATATGCAGCTGATAAAAAGCACAAACATCATTATCTCTGTGATACTGGAGT 
CTACACGATCGACGCAACAGTCTACGGAAATCCATCTCGATTTGTGAATCAT 
AGTTGTGATCCTAATGCTATATGTGAGAAATGGTCTGTACCAAGAACTCCTGG 
AGACGTTAATCGAGTTGGTTTCTTCTCGAAACGATTCATTAAAGCCGGCGAA 
GAAATCACATTTGATTATCAATTTGTCAACTACGGACGTGACGCTCAACAATG 
TTTCTGTGGAAGTGCTTCATGTAGTGGATGGATTGGGCAGAAACCGGAAGAA 
TTTTCATCTGATGAGGATGATGATATTGTGACTACAAGGCATATTAATATGGA 
TGAAGAAGAAGAAGAAAAGTTGGAAGGTCTTGATCATCTTGGAAATCATGAA 
CGGAATGAAGTGATCAAGGATATGTTGGATGATTTGGTCATTCGGAATAAGA 
AGCATGCTAGGAAGGTTATCACAATTGCGGTAAGCATTTATTTGTAGAGAAA 

-ATTTAl^A^AWA^AGA-TGGAGTAeeGAm-TeeGAGAA-AT^^^^ 

TCCAATTTTTCCTCTGATTCCGAATTTTTAAATGAAAAAATTCAAAAAAATTT 
CCTTGATTTTATGTTTTAACTTGAAATTGCGAATTTCATTTGTACAGATTTTTG 
AAACGCCGAATTTTCGCGGC AG AGAAGCCATGTGTCG ATTTTTG AGATTTGTG 

TATATTTACAAGATTTTGAATCTTCATCGGATGCTGATTTGCGTTTTTCATCAT 

TATATTATCAAAAAACTAACAATTTGTTCGGTTTTACGGAAATTAACAATATA 
GACTAGACATTTCGTAAATATACACAAATCTCGTAAATCGACACATGGCGTC 
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TCTGGCGCGAAAATTCGGCATTTGAAAAATCTTATGCGGGCACTA ATGAA AT 
TCGTGATTTCAAGCTGAAATATAAAATCAGGGAATTTTCCTTGCATTTTTTCA 
CTCAGAACTTCGGAATCAGTTGCAAATTTGGAGTCATTTGAAAATA TTTCTCA 
GATTTCGGTACTCCACCTTTATTATAATTTTTAAAATTTTTTAAATGATTTTTT 
TTCCATGTTCAACAAAAAAATAAATTTTCAGTCTGCAATGACCGATTACTCTC 
AACGTGTGGATGTCATTCAAGAAATCTTCTCCTCAGACACCTCCGTAACCGTT 
CAAAAATTCTATGCAAAAGAGGGAATGGCTACATTGATGGCTGAATGGTTGT 
CTGAAGATGATTATTCGCTGGATAATCTGAAACTTGTTCAAGCTATTCTCAAA 
GCTCTTCACACTGAACTATTCGATTCGTGCGCCAAAAATGATCGACTCTTACG 
AGATTCTACATCACGATGGGTCAATGCGAAAATGGATGAATATGTTGATATA 
CAAGTGATAGCTGATTCACTTATTGCTTGTGTTGAAGATCCCGTACAGGAGTA o O 
CAAGGATGTTTGCAAAGTTATAGAGGTATATACATATTAATTTTTAAAAAAG 1 1 
AATATTTTTTGCATGTCACAAAATATTTGGAAATTTTCCCGAAAAACCCATGA ^ < 
AATCAAAAAACAAATTAAATAGTAAAATTATTTCCTCCTACGAACATTTTTCG o g | ^ 
ATTTTTCGXTTTCCGATATTCCTTTTAAAAATCTGATTTAAAAAAAAAAAACT .:tg 
TAAATTTTAGGTCTTTTTGCTCCTTTTTAGAAGCAATTTATATGTTTTT TAAAA ^ g ^ 
CAAAACTTAAAATTAGCATTTTTATGGGTAATTTTCTGAACACATTTTTTTTTC go d 
GAAAAAAATGGCCAGAATTTCAACCACTTCTCCGTAAAATCGAAATTAACTA ^ hi ^ 53 
ATTTTTTCTCTATACATTTTTCAAAAAAAGACTCCTCATTTATTGTATTAGATA S | 

CAAATATATGTTTTCCTCATCAAAATTTACGAAATTTGTTATAATTTTGAATTT g ^ S | 
TTTTTGTTTTTTTTTCGAAAAATTGAAAATTTTCTAATTTTGAAACGATATTAT Q § ^ ^ 
ACAATTTCAGCGCCATCAATTTAACTAATTAAATAATTTCAGAAAGGTCTCGT < o - 
CGAAAACTTCACAAGAGCCAAAGAGATGGCCTATCGGTTAAATCAATACTGG ^ h ^ E 
TTCAATCGATCAGTGAGCTTCAAAATTCCAAAAAAGATACGTGATCCTGTGC | < | ^ 
CAAAAGATGTTCCAGTCAGACAAGAAGATGCTACAACATCATCACAATCTCA ^ ^ 5 
TGATAATAGTAGTAGAACTGTATCACCG AATCATCGACATCATTCATCTTCAT o. f ^ ^ o 
ATTCAAATTCATGTTATCAAGAACGAGAACCATCTCATATACGATTCTTTAAT g w § a 
AATGGAAATGATGTTCATCAATATCGTTTTGGAGGTTATCATGGAAATAACTA oj g S, 
CAATGATAACTATTTCAGTAGAAGGCCCAATAAGGATTCATATCGAGATCGC F o ^ E £ 
CGTCGATTTAATGGACGTCGTTCGAGAAGTCGATCAAGAAGTGTCTCACCAC 
AGAACTATAAAAGAAGAAAACTCGATGAACATGACAATAATCATCGTCAGC 
GTTCTCCAATTCGTGATCGTCACACATCTCCCGGCGGCGAAAAGACTCCTAGC 
TCGAATAATTCTGGAGAACGAAACTATAAAAGACTGGATATTCGAGGAGCTC 
GTATAAAAACTATAAAAGAAGATTTGGAAGCTGCTGCTGCTGCTGCTGCTGC 
TGCTGCTGTACCATCAGAAGTGCAAGCTTATCCTCATGAACATACAGCTGTAC 
ATCAGAGTGTTTATCAGATGCCAGGTTATGAGTCTTATGGTTGGTTTAGTTTT 
TTTAAAAATATCATTTACCAGGGTGCCATTTTTAAAAATAAAAATAACTCGGA 
AAATATGTTTTTAAAAAATTTCAGAATTTCTCTCATCAACATAAAACTTGATA 
-AAAATCGAATTTTTATTATTTTCTAAACATTTTTTCGGTTTTTCCGAAAATCAA 
AAAAAAAGTTTAGAAAATAGCAAAAAATCAGTTTAT TAGAA ATCAAATTTTG 
TTCGTTTTGATAAGAAAAAACATAAGAAAACATGTTATTTTCTTCTGAAAAAA 
GAAAAAAATCGAAAAATCTATGGCCTTTTGGCAAAATGTTTTGGACCAAAAA 

acaaTacaAatagcattaAaa™ " 

ttttctgaaagtcttgcttgtcgtatatcaaataaaaacatttttcaggagta 

tatgatcctgtaaatggtgtctacatgtatcctcatcctggcgctggttacta 

tccacctgcctatccacaacaaccgattatgttaacaatggacactcttccac 

cgaatgatcgtcttggtgaactttacgagaaagccagtatcgagcagctagc 

GTGAGCATTTTTTAGTTTAAACCTTTCGGATTTACCTAGAAAAATGTTACCTTT 
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GACGCAAAATTACGGTAGCAGGTCTCGTCGCGACCGAAATTTTTCAGCGGAG 
TACGGTAGCTTCCCATGAATTTTTTTGCTGAACTTATCTTTCTGATAACAAATA 
GTAACTAAAACATGAAAAACTGAATAAAAATTGATATCTTTACCTTATAGGC 
TCTTTAAGGGCGCAGACACAAAAACTGACCGGCTACCGTAATTTTTCGTCAA 
AAGTCACACATTTCrCAACTGGTGAAATCCGAAAAAATTGAAATTTTTACTAC 
. TCGTCCGACTGTTTAGAAAAGATTAAAAAAAAAGAAAAAAAGAATGTCGGTT 
TTTCGAATTTTCGATTTTCAAAGAAAAAAATGAATATTTAAAAATCATTTTCG 
GTAATTTCCCTAAATTTGTAAAATATAATTTCCAATAAATGTTTTTTGTTTTCC 
GGAATTTTAATAAAAAATCAATTTTCGCGTAACAAAAATGCGAAAAAATGAC 
TAGCCACTCGAATATAATAACACATGAAATAAAATTAAAATTATTACAGTCA 
ACGAGATGCAATTGTGAGACAAGAACTTGAGCTGATACGTATTCAAATCGAA 
AGAAAAACTGCTCAAAAAGAAGCGATCAAGGCCGCTTGCCGTCGTGCTAACG 
AAGAAGAAGCTAAACGACAAGAGGCACTTGCAAAGACGAAATATGTTTGGG o O 
CGATTGCAAAGTCAGAAGGTGGAGAGACGTATTACTACA-ACAAAATAACAA 1 1 
AAGAGACGCAGTGGACAGCACCAACACCAGTTCAAGGTCTTCTCGAACCGGC ^ | 
TTGTGGTGCATCTCCTGATACTACAGTTGTCATTGCTGACGAGATTACTGAAG O 52 ^ o> 
AAGAGCAACAAGCTGAAGTTCTGGAGAAGCCGCGTGTTGTTAAGGAAGAAG ^ ^ - « 
TTATCGAGCCAGGTTCACAATCTGAAACTCAAAAAGAATCTCCGGAGAAAGT 2 ^ 2 
TCGAGTTGTTGTACCGAAAGTTGAAGTTGAAAGATCACCGTCGCCAAAATCT ^ § 16 
TCTCGTGATCGTGAGAAGGATCGAGAGAAATCTCGTGAGAAAGATCGTGAAA 2 ^ ? 
GAGATCGTGACAGAAGAGAAGGTTCAAAACATCGTGATAGTTATCATGGACA a -J S E 
TCGAAACGGCAGCAGTTCTGTCAGTGAACGACGTATGCGAGAGTTCAAACAT on ° « 
GAGCTGGAACGATCCACTCGATCTGCCGTTCGTTCTCGTCTACAACATCAACG Q § ^ ^! o 
TGACGCTTCTAGTGATAAGACTACTTGGCTTATTAAGTTAATATATCGAGAGA < O | 5 
TTTTCAAACGAGAAAGTGCGCAGAGTGGATTTGATTATCGATTCAGTGAGAA ^ ^ ^ I 
TACTGATAAGAAGGTAATATTATGGACCAAAAAATAAACAATTGAAAAAAA | < 1 1 
AACCAAAAAAATCTG ATGCTTGA ATTTAA AAA AAAACAATG AAAGAGTGCA ^ ^ 5. w S 

TCCAAAGTACCAAACTTCATTTTAAAAAATTTTATTTGACATAAAAATTGATA § w | § ^ 

ATTTAAAACTAATTTGAACATTTTTCCGCAAAAATTATAGATTTTTCTGCCAA « g S ? g, 

TTTTAGATTTTTAACGTTTTTTTTCGGACAATTAATGTTTCGAATCATCAATCA P O <-if £ 

GAATGAATATGATATCTGATGAAATTCAAAAATAATGCAATTTAAATAGAAA 

ACGGTACAAAAGTTTTGAAAAATTTAGAAGAATTCTAAAAAAAATCCTGTCC 

TTCAGGACAAAATTCAACCTTTTTCTCAAAACACAAAAATTACTTTATATTAT 

TTTTCAGGTGAAAAACTACGTCAAGTCATATATCGACCGAAAACTCGAATCA 

AACGATCTCTGGAAAGAATACTCTCGGCCATGAGCTTTATTTTTTAATTTAAA 

TTTTATAAAAAAATGTTTATGCTTGTTTTTTTCTCTATAGTTCCCTCCTATCCC 

CCCCCTCCCCTATCGCCTAAAAATTGATCTCTGTCTGATTTCACCGATTTCCGT 

TTTATTTGATCCCATTGAACGAGTATATCATCATGTTCCTGAACTTCAACGTTC 

GCACATTTTATTCCCCTAGTTTTATGTCCCCAGAATTGTTTTATACTATCCTGT 

AATCCACCTCAAAATGACAGCCATGAAAAGCTGTTTTTCATGTTTTCTATTTT 

-GTTGTTGA-TG-GTA-TTTGGGGGGGTG-TT-TGTGGGeAAATO 

AAAATGAATTACGGATGTTGAATTTTTAAATTTATTTTTTTAAAGAAAAATTG 

TGGAA GTTTTTC AGATTCTATACTGCTTATTTTTACGCTAAATTTTTTTTCGAA 

GTCCCCmTTTCAAATCGAAGTGTAACTGCGCTCCACGATCAATAGAGACTC 

TCCGCC CTCG AACCATGGGTCTCGTTAGGTATTTGGCAGACTTACCGTAAATT 

CAAATGTTTTATTACTTCGCGACTAATTTTTTTATTCATGACTCAATTTTTTAT 

CAATTCCAACGAAAAACTAATTAAAAACAACGGAAAACATAACGAAAAATG 
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CTTGAAAATTGCAGACATTTCCGAAATTAATTAAATTCCTAACGAGACCCATG 

GCTCGGGGGCGGAGTGTTTTCGATTAGCCATGGAGCGCGTTGAGATATTCCT 

AAATTTTTCTATTCAGATGTCGAATCAATCAAAACGGGTCACAGTGAGAATT 

GAAATTATTCGTCAATTTTCAGAGTTTTAAAATTCCAACATCAAGAGCAAGA 

AGATCGGAAGCTCAAATATGTTCTGCACAAAGCTCACGAGAATCTGAGAAAG 

TGCCCATTCGAGATTCTGACAATTG 



Figure 24 LIN(n3628) Protein 

MFQRKVVLPKKRTEMVQTRRKTAAAVQDGGAXOCENKAKPPAPQTPTKRAKRG 

RPPKJKTDANTLNTPSTSSNLVDDKLLIESESQDSILTNEADSFLEKEVEEffiDSSDI 

LPDKINSPEKPSVLVKRRSSTRLKVKTDEDEKDVPVNIEVAVLEEKSIQIEPTSPAH 

PEDPQPSTSSLPLVEPIEDIVEPNEPTSSADPPVSNIKDEDIKEEEPLIKKPASDESES 

MDIANSESGNDSDSSEADPRTffSFSIPLPDTPPPNFAKRGEfflVDVDQKNSKQSGE 

SQSPWERAREKSASNPLSSPTMSRPRJHFLHPAYQSFTNDSVSPLPPPPPEPAPARE 

KVENGGPTTFKMTFKKAANIPILKTSAFEQPSSPPPSSSVSSSISLSEVNSSTSIASES 

SPAKRSSNFDLTASNELPPPQMVELPKLSFFNMPPAVRSAEDDSAMTSEEPILLLR 

SPNSATPDDDALFLTTPPPPKMTESEIQALKVATEKVNQVIARREDSEKDVRHRE 

DRDDYDRRRDDRDRRSRKTDSERNDQRGRQREDDERRAREREREVTKRHDRER 

EEMRLQKQKDEERRKIGDEEERIQKENDEKKQKEDEAmEEEKKKJKEEEMKIPE 

FELISESKYLTRNANKKKTESLTCECHRTGGNCSDNTCVNRAMLTECPSSCQVKC 

KNQRFAKKKYAAVEAFHTGTAKGCGLRAVKDIKKGRFIffiYIGEVVERDDYEKR 

KTKYAADKKHKHHYLCDTGVYTIDATVYGNPSRFVNHSCDPNAICEKWSVPRT 

PGDV>{RVGFFSKRFIKAGEEITFDYQFVNYGRDAQQCFCGSASCSGWIGQKPEEF 

SSDEDDDIVTTRHINMDEEEEEKLEGLDHLGNHERNEVIKDMLDDLVIRNKKHA 

RKVITIASAMTDYSQRVDVIQEIFSSDTSVTVQKFYAKEGMATLMAEWLSEDDY 

SLDNLKLVQAILKALHTELFDSCAKNDRLLRDSTSRWVNAKMDEYVDIQVIADS 

LIACVEDPVQEYKDVCKVIEKGLVENFTRAKEMAYRLNQYWFNRSVSFKIPKKI 

RDPVPKDVPVRQEDATTSSQSHDNSSRTVSPNHRHHSSSYSNSCYQEREPSHIRFF 

NNGNDVHQYRFGGYHGNNYNDNYFSRRPNKDSYRDRRRFNGRRSRSRSRSVSP 

QNYKRRKLDEHDNNHRQRSPIRDRHTSPGGEKTPSSNNSGERNYKRLDIRGARIK 

TKEDLEAAAAAAAAAAVPSEVQAYPHEHTAVHQSVYQMPGYESYGVYDPVNG 

VYMYPHPGAGYYPPAYPQQPIMLTMDTLPPNDRLGELYEKASffiQLAQRDAIVR 

QELELIRIQIERKTAQKEAIKAACRRANEEEAKRQEALAKTKYVWAIAKSEAGET 

YYYNKITKETQWTAPTPVQGLLEPACGASPDTTWIADEITEEEQQAEVLEKPRV 

VKEEVffiPGSQSETQKESPEKVRWVPKVEVERSPSPKSSRDREKDREKSREKDR 

ERDRDRREGSKHRDSYHGHRNGSSSVSERRMREFKHELERSTRSAVRSRLQHQR 

DASSDKTTWLIKLIYREIPKRESAQSGFDYRFSENTDKKVKNYVKSYIDRKLESN 

DLWKEYSRP 
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Figure 25 



lin(n4256) genomic sequence (1 kb of upstream and downstream genomic sequence is 
included in this file). 



GCTTGCATCGAAACTCTTCTCATTATTTACGTGATGATCACATCTTTCGTTGGG 

CTGTACTCCCTTCCGGTTCTTCGTTCTCTTCGACCTGTTCGAAAAGATACTCCA 

ATGCCAACGATAATTATTAATTCTTCAATAGT TCTTG TTGTTGCATCCGCTCTt 

CCAGTAGCTGTTAACACAGTTGGAATGACAACTTTTGATCTTCTCGGCTCCCA 

CTCATCGCTCCAATGGCTTGGATCATTTCGAGTCGTTGTTGCCTATAATACTCT 

ATTCGTCGTGTTGTCTGTCGCATTTCTCTTCAATCAATTGACTGCTTCAATGAG 

AAGGCAAATCTGGAAGTGGTAAGCTGTGCAATTTAAAGTTTAAATTCTTATTA 

ATTTTTTTGCAGGATATGTCAACTACGATGTGGAATCAGACGGGAGAGTGAT 

GCGGATGAAACCATTGAGATCCTTAGAGGCGATAAGAAAAGCAATTGAATTT 

CTTTCCTTTTTCAACACTTCTTACCCATGTTCATCATTTTAATCTTTTCATTACA 

AAAACAAGGTCCTATTTTTTTTCTCGGGTACTACTCGCCTTTTCTAATAATTCA 

GAATCATCAATTTTTGCCAACCTCTAGCTTTACATGTCTGTTTTTCATCATT^ 

CTCTCAAGCATTCTCCTAATATATTATGTTCCCTAGTATTTCCCCTCAGTCAGC 

AATTTTCTCGTCGTCGAAACCGTTTAGCTTTACTTTCAATCAAAACGTGGAAC 

ATTTTTCAAACTATTTGAAGCCAAAAAAAACCAGGGCTTTTGTATATGTACCA 

TATTTTCCCTCTGATTTTCTTTATCGCCTTCTCTTTTCATGTAGAATAACTGAA 



AAATTATTTCTTCAGGTTTGAAAAAATCCGAAATATGTATCTGCAACTCTTCA 

GGGCATTGCCTCAATTAATTTTTATCTAATATTCAGATGGACCAACAAGAACC 

ATCGAATAACGTAGATACGAGCAGTATTCTTTCGGATGATGGGATGGAAACA 

CAGGAACAAAGTTCATTCGTCACTGCTGTGAGTGAAATTATTTAAAATTTCGC 

TTCGGAGATTCATTGTCATATAATTCAATTTATCGATTTTCAGACAATTGACC 

TAACAGTGGACGACTACGATGAAACAGAAATACAGGAGATTCTGGATAATG 

GAAAAGCAGAAGAAGGAACAGATGAAGATTCTGATTTAGTTGAAGGGATTCT 

TAACGCTAATTCAGATGTCCAAGCGCTCCTTGATGCGCCATCTGAGCAAGTA 

gctcaagctcttaattcgttcttcggaaatgagagtgaacaagaagctgttg 
cagcacaaagacgggttgatgcggagaagactgccaaagatgaagctgaac 
-tgaag€aagaggaagagggggwaga-t3fge-aa3^aaaggaaagaa^faataaa 
attattttattttcaggaagatcttattatagaagattcgatagtcaaaactg 
atgaagaaaaacaagcagttcgaagactgaaaatcaacgaatttttatcgtg 
gttcacaaggctccttccagaacaatttaaaaatttcgaattcacaaatccga 
actatctgacagaatctXtcagcgattcaccggttgtaaatgtcgataaatgc 
aaggaaattgtcaaatcgttcaaggaaagtgaatcacttgagggactttcac 

AGAAATACGAATTAATTGATGAAGACGTGCTAGTCGCTGCTATTTGTATTGGC 



Exon number 



Exon boundaries (inclusive) 



2 
3 
4 
5 
6 
7 



1001 - 1096 

1166- 1453 

1501-2199 

2298-2730 

3234 - 3847 

4148-5778 

6111-6333 




TTTAATTATCAATACTGTCCGTATAGGTAA 
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GTTCTCGATACCAACAACGAAGAAGATGTCGACTTTAATGTTCTATGTGATGA 

TCGTATCGACGATTGGAGTATAGAAAAATGTGTCACTTTTCTTGATTATCCAA 

ATACTGGATTGAATTCGAAAAATGGACCGTTGAGATTCATGCAGTTTACTGTC 

ACATCACCTGCATCAGCAATTCTCATGCTCACTCTGATTCGATTACGCGAAGA 

AGGGCATCCGTGTCGATTAGATTTTGATTCAAATCCGACTGATGATTTACTCT 

TGAATTTCGATCAAGTGGAATTTTCTAATAATATCATTGATACGGCAGTCAAA 

TACTGGGATGATCAGAAGGAAAACGGTGCGCAGGATAAAATTGGCAGGCGA 

GTATTAATCAAACTCACAACTGTTTTGAAAGTATTTTCATAATTATCACTTAA 

ATACCTTTTAGAGAGCTCAACGACTTCTTCCACGAAATCGAGTCAACATCAGC 

AGAATTCAAACAACATTTTGAGAACGCCGTTGGCAGCCGTAATGAAATAATT 

CAACTTGTCAACGAGAAAATTCCCGATTTTGATGGCACTGAGGCTGCTGTGA 

ATGAGAGTTTTACATCCGATCAACGAACCGAAATTATCAACTCTCGTGCAAT 

AATGGAGACATTAAAAGCCGAGATGAAGCTCGCCATCGCCGAAGCTCAGAA 

AGTTTACGACACCAAGACTGACTTCGAAAAATTCTTCGTTTTGACAGTTGGAG 

ATTTCTGTCTGGCTCGCGCCAATCCTTCTGACGATGCAGAATTAACATACGCC 

ATAGTTCAGGATCGTGTGGATGCAATGACCTATAAGGTTAAATTTATCGACA 

CAAGTCAGATCAGAGAGTGTAACATCAGAGATTTAGCCATGACTACGCAGGG 

AATGTATGACCCGAGTTTGAATACATTTGGTGATGTTGGTGAGTTTTAAGTTA 

AAATTGATATTTAATATTACATCTGTTATGTAGAATAAGGGTTTCGGTTTTTC 

GATTTTATTAGAAAATCGAAAATTTTAGTTTTTGTGTTAAATTTAAAAAAATC 

AAAATTTGATTCACTATCAAGTCCGTTTTTCTCTTCTCAAAATTGACAAAATTT 

TGATAATCTAGAATTTTCGTCCCGTATATTTTTCAACGAAAAACCATTTAAAA 

TTTTCCATGATTGGATTTTCGGTTGATCTAGAAAAAAATGGTGCTAAACACTA 

AATTTGAAAAAGTTTGAAACAAATTCAAATCCAAATATTTCATGAAAAACTT 

GTAAAATATATTATGTACACAAAAAAACGTTTCAAGTGTAGCAGTTGTTTTTT 

GTGGTCCCAAAAAAGCAGATGTTTGTCAGAATCCATTAAACAACAAAAAAAT 

CCAAAAACTCAACCTGGCCTAGATATCAGTTTCATGATCGAAGTATCTAAAA 

TCATTGTTTTCAGGTCTTCGAGTTGCCTGTCGCCAAGTTATTTCCTCGAGCCAA 

TTTGGAAAAAAAACAATTTGGCTTACCGGTACAGCTGCCGGACGTCGCAGAG 

CTCATAGATCCGATTTTCTAATTTTCTTCGACAACGGAACCGATGCATACGTG 

TCAGCTCCGACAATGCCTGGTGAACCAGGTTATGAAGTTGCTTCTGAAAAGA 

AAAGTGTATTTTCTCTCAAAGAAATGATTGCGAAGATGAATGCTGCTCAGATT 

GCTATTATGGTTGGACAGCCAGTAGGAAAGGAAGGAAATCTGGATTATTTTT 

TGACATTTCATTGGATTCGACAATCTCACAGATCAGCGTATATTCGGGATTTT 

ATGAAAGAATTTCCGGAATGGCCACTTCTCAAGATGCCAGTTGGAATGCGAA 

TCTGTTTGTACAATTCTCTTGTTGATCGACGTAAGAAAATGGTGACAGTGATT 

GGAACTGATCGAGCTTTTGCTATTGTGAGACACGAAGCACCGAATCCATTGG 

CTCCTGGGAATAGATGTACAGACTTTCCGTGCAATGATAGAAATCATCAGCA 

TATTGACGAGAAAATCTATAGAGGATCTCATAGATTGGAAGGCGCAGCGGTA 

AGATTTTATTTGAAAAATTGATACAAAACGAGGATTTTCTAAAATTATTTTAT 

TTTTATTTGATTTGATTTCTTATAATTGATAATCAAGGTTTTTTGGATGTTTTG 

rrAGAGAAATCGAAAAGGGAAACTTjCCAAAAAAAAGCTGTGAAAXCAATTT^^ 

TGCTTTTAATAATATCCAAGTTTCATCTTCAAAGTTTTTTCTATAAAATGGACA 

CAAACTTTTCAACGTTTTCAAAAAAAAGGTTCCGAAAATATGAAAAAAGGAG 

AAAGAAATCATGAAAATTTTGTATTATTTCAGCACAAGAAGCACATGATCTC 

GACAAATAACAATCTGTCGCAACGCAGAAAAGACCAGCTTCAATCACAGTTC 

GAGCCAACCGACATGATTCGTTCGATGCCAGAGAGGAATCACCAACAAGTCG 

TTAAAAAGAAAACGACGGGCACCAATCAGAATGTCGCTTCGACAAATGATGC 
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AAAATCGAAGAGAGAAATTGAAATAAGAAAGAAAAATCAATTCTTATTTAAC 
AAGATTATTGTTCCAATACCCGTCCTAACACCATTGGAAAATCTCAAGGCTCA 
TGCTCAATGTGGTCCAGATTGTCTACAGAAAATGGATGCGGATCCGTATGAA 
GCAAGATTCCATCGAAATTCACCAATACATACTCCTCTTTTGTGTGGTTGGAG 
ACGAATTATGTACACAATGAGTACTGGAAAGAAGCGGGGAGCAGTGAAGAA 
AAACATTATTTACTTTTCTCCATGCGGAGCCGCTCTTCACCAGATCAGCGACG 
TCTCTGAATATATTCATGTCACCAGAAGTTTATTGACGATTGATTGTTTTTCAT 
TTGATGCACGAATCGATACTGCCACTTATATTACTGTTGACGATAAATATTTG 
AAGGTTGCTGATTTTTCGCTTGGAACCGAAGGAATCCCAATTCCACTAGTGAA 
CAGCGTGGATAACGATGAGCCTCCATCATTGGAATATTCGAAACGACGATTC 
CAATACAATGATCAAGTGGATATATCGAGTGTTAGCCGAGATTTCTGTTCTGG 
ATGCTCTTGTGATGGTGATTGCAGTGACGCATCGAAGTGTGAATGCCAACAA 
TTGTCCATTGAAGCAATGAAACGACTCCCCCATAATTTA CAATT CGACGGAC 
ACGACGAATTGTATGAGAGTTCAGAAAAACAAAATAAATTTTTAAAACTATT 
XrTrrTCAGAGTXCCTCACTATCAAAATCG-TCXTCXCAGCAGTAAGGTTATCA 
GTGGACTCTATGAATGCAACGATCAGTGTTCATGCCATCGAAAGTCTTGTTAC 
AACAGAGTTGTTCAGAACAATATCAAGTATCCTATGCATGTGAGTTTATTTAA 
CGATGATACATACCAATTATTGTTTTTTCTTCAGATCrrCAAAACTGCTCAATC 
CGGATGGGGAGTCCGAGCTTTGACGGATATTCCTCAAAGTACGTTCATTTGCA 
CGTATGTAGGTGCTATACTGACGGATGATTTGGCTGATGAACTAAGAAATGC 
GGATCAATACTTCGCTGATTTGGACTTGAAGGATACCGTGGAGCTGGAAAAG 
GGTCGCGAAGATCATGAAACTGATTTTGGTTACGGAGGAGACGAGTCAGATT 
ATGATGACGAAGAAGGAAGTGATGGTGACTCCGGTGATGATGTAATGAACA 
AAATGGTGAAACGTCAAGACTCTTCGGAGAGTGGTGAAGAAACAAAACGGC 
TGACAAGACAGAAAAGAAAGCAATCTAAAAAATCCGGTAAAGGAGGAAGTG 
TGGAGAAAGATGACACCACTCCAAGAGATTCAATGGAAAAGGATAATATTG 
AAAGTAAAGACGAACCCGTTTTCAATTGGGATAAGTATTTTGAGCCGTTTCCA 
TTGTATGTTATAGATGCAAAACAGAGAGGAAATCTTGGAAGGTAAGATCACA 
ATTTTATTCATTAAAAAAATTTTTTAGAGATTTTGCTTTAAATGATAAAAAAT 
GGACAAACCAACCGTTTGCCTCTTCTTTTGGTTTATCAACCTTTCTCTATGGAA 
AAAATTCTGAAAAATTAACAAACAGTATTTCACGTTGAAAAGTGAAGAAAAA 
AGCAAAAAAAGGAAACAAATTTCAAAACGGTTCTACTCCATCTTAAAAAAAC 
TAAAATTCGTAAAAAGTCATTTGGTATGTTTTGGAGACTATAATACAATTGAG 
AAAATTTGAAAAACCGGCACTCCAAAGATACAATCATAAATTTTCGATAACT 
TTCAGATTCTTGAATCACTCTTGCGATCCGAATGTGCACGTTCAACACGTCAT 
GTACGATACGCATGATCTTCGTCTTCCATGGGTCGCGTTTTTCACACGAAAAT 
ACGTGAAAGCCGGCGATGAGCTAACCTGGGACTATCAATATACTCAAGATCA 
GACGGCTACCACACAACTCACATGCCACTGCGGAGCTGAAAACTGCACCGGC 
CGTTTGCTGAAAAGTTAAAGAATTGTTGTTATTTCCTTCCCAGTTATGTTTTCC 
TTTTTiTTTAAGTATTTATTTATTTATTTAATTTTTATTTTGm 
GTTTAAAATCTCCCTTTGAAAACAGCATCTCATAT GTATG ATCTAAA CACG TA 
_TTTACCTCGTAACiGGTTTG^^ 
CTGCGAAfAAAATGTTTTAAAAAAGACATTAfAt 

TTTTGATGTCTCCAATCTATTTCAGTTTACAATTTTAAAATATAGAATATATAT 
ATTTAGGTTTCATAAGTTATGCATCGATTACGGGTTCTACGTCACTTGAAGTT 
. CTGCATTTCCACGTCACATAGGACTACTGTAGTTTTAAAAAATACTCGTTCAT 
TTTGTAATAATATTCCTTCTACTAGTTTTGCTTCTGGTAATAATCGAATTTCAA 
AACTTTAGCTAAAATATTTCTTTTTGAAGAGGCTGCAGCAAAATATGAAAAG 
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AAAAGTCCAACTGAACATGTATTACTTCGACCCGATACATATATTGGAGGTG 

TCGCCATGCGAGAAGATCAAATTATTTGGCTCAGAGACTCAGAAAATAGAAA 

AATGATTGCAAAAGAAGTCACTTATCCACCTGGATTATTGAAGATTTTCGATG 

AGATTCTAGTGAATGCGGCTGATAATAAAGCAAGAGATTCCAGTATGAATCG 

GTTGGAAGTATGGTTAGATAGGTAAATATATTGCAGGAATTTATGTTCTGCGA 

CAAAGCTACGATACGCTGTCTCGCCACGACAATTGTTTTGGTAAATGCATGA 

AAATCGACGTGCACCTTTAAATAATACTGTAGTTTTAAATTCTCGTTTCTTCA 

ATTTTTCATAAATGGTTTTCCGATGAATATATGATTTTAAAAAAATCTAAAAT 

TCACATTAATTTATAAGAAACAAAATTCCTCAAAAACGAAAGTTTGGCGATA 

CAGTACTATC 
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Figure 26 

LIN(n4256) amino acid sequence 

MDQQEPSNNVDTSSILSDDGMETQEQSSFVTATIDLTVDDYDETEIQEILDNGKA 

EEGTDEDSDLVEGILNANSDVQALLDAPSEQVAQALNSFFGNESEQEAVAAQRR 

VDAEKTAKDEAELKQQEEAEDLIIEDSIVKTDEEKQAVRRLKINEFLSWFTRLLPE 

QFKNFEFTNPNYLTESISDSPVVNVDKCKEIVKSFKESESLEGLSQKYELIDEDVL 

VAAICIGVLDTNNEEDVDFNVLCDDRIDDWSIEKCVTFLDYPNTGLNSKNGPLRF 

MQFTVTSPASAILMLTLIRLREEGHPCRLDFDSNPTDDLLLNFDQVEFSNNirDTA 

VKYWDDQKENGAQDKIGRRVLIKLTTVLKNAVGSRNEnQLVNEKIPDFDGTEA 

AVNESFTSDQRTEIINSRAIMETLKAEMKLAIAEAQKVYDTKTDFEKFFVLTVGD 

FCLARANPSDDAELTYAIVQDRVDAMTYKVKFIDTSQrRECNIRDLAMTTQGMY 

DPSLNTFGDVGLRVACRQVISSSQFGKKTIWLTGTAAGRRRAHRSDFLIFFDNGT 

DAYVSAPTMPGEPGYEVASEKKSVFSLKEMIAKMNAAQIAIMVGQPVGKEGNL 

DYFLTFHWmQSHRSAYIRDFMKEFPEWPLLKMPVGMRICLYNSLVDRRKKMVT 

VIGTDRAFAIVRHEAPNPLAPGNRCTDFPCNDKNHQHIDEKIYRGSHRLEGAAHK 

KHMISTNNNLSQRRKDQLQSQFEPTDMmSMPERNHQQVVKKKTTGTNQNVAS 

TNDAKSKREffiIRKKNQFLFNKIIWIPVLTPLE>nLKAHAQCGPDCLQKMDADPYE 

ARFHRNSPmTPLLCGWRRJMYTMSTGKKRGAVKKNIIYFSPCGAALHQISDVSE 

YIHVTRSLLTIDCFSFDARIDTATYITVDDKYLKVADFSLGTEGIPIPLVNSVDNDE 

PPSLEYSKRRFQYNDQVDlSSVSRDFCSGCSCDGDCSDASKCECQQLSffiAMKRL 

PHNLQFDGHDELYESSEKQNKFLKLFFFRVPHYQNRLLSSKVISGLYECNDQCSC 

HRKSCYNRWQNNIKYPMHVSLFNDDTYQLLFFLQIFKTAQSGWGVRALTDIPQ 

STFICTYVGAILTDDLADELRNADQYFADLDLKDTVELEKGREDHETDFGYGGD 

ESDYDDEEGSDGDSGDDVMNKMVKRQDSSESGEETKRLTRQKRKQSKKSGKG 

GSVEKDDTTPRDSMEKDNIESKDEPVFNWDKYFEPFPLYVIDAKQRGNLGRFLN 

HSCDPNVHVQHVMYDTHDLRU>WVAFFTRKYVKAGDELTWDYQYTQDQTAtT 

QLTCHCGAENCTGRLLKS 
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Figure 27 

lin-65 genomic sequence (1 kb of upstream and downstrearai genomic sequence is 
included in this file) 



Exon number 


Exon boundaries finclusive) 


1 


1001-1133 


2 


4522 - 5208 


3 


6128-6361 


4 


7962 - 8350 


5 


8706-8928 


6 


9260-9516 


7 


10328- 10567 


8 


11677-11700 



AAAAATTTAAAAAAATTTTTAAAAATTCGTGTAA AAATT ACCCCGGTTGTTTA 

GGAAATAATAAAGAGATTAGAGACTTTTTTCAGATTTTTATTTTCT TGAG TT^ 

TGCCGGTTTTCAGCCGATTTCTATCTTTTTTTTCTCATTTTTO 

CGCTAGTTTTCCCCTCAATTTCTCGATTTTTTCACGATTTTTTGAAAATTTTCG 

GAAAATTGAATTGTTTGCAAAAAAAAAAATTCAAAAACCGCATTTTTCTCAG 

AATTTTTCTGGGATTTTGTACAAATTTTTGAATTATTTCTCAAAAAAAAGCAG 

GTTTTTACCGATTTTTTTGGTTTTTTCCCCAAAATTTTCCGATTTTTTCCGAGTT 

TTTSGAAAATTATTTGAAAAAAAATCAAAAAACCGCATTTm 
TTTCTGGGATTTTGTACGAAATTTTGAAATTTTTCTCGAAAAAAGCAAGTTAT 
TCCCCAAAATTTTCTGATTTTCCCCCAAAAATTTAGATTTTTCCCGAGTTTTCC 
CCAGTTCTCAGCTGATTTCTATATTTTTTTCTCAATTTTTGTGATTTTTTGTTGC 
TAGTTTTCCCTTCAATTCCTCGAGTTTTTCACGATTTTTTGGAGATTTTCGAAA 
AATTGTTTGAAAAAAATCAAGAAACCACATTTTTCTCTGGATTTTCTCGAAAT 
TTGCACAAAATTTTTGAATTTTTTCGTAAAAAAAAACTGTTTTCCCC AAAAA T 
TTCAGATTTGTTTTTGATTTTTTTCGAGATTTTCCCCTGATTTCAAAGTTTTTTC 
CTGAATTTTTCGAATATTTCCTGAAAAATCGGCTATTTCTAACTTTTTAAATAA 
TTTTTTTTGAATTTCTGACTTTTTAAATCCTTTTTTTTTTGCCAlll 1 
TAAAATTCTAAATTATTCAAAATTTTACAGAATGTCAGAAGTAATCGACGAA 
AGTATCTTAAATACAGAAGCTTCAGATGATCCAATACCTCCATTAAATGATG 
ATCAGATTGCTGAGCTTTTGGGTGAAGATGGAGAAATTATGGAGATAACTGA 
GCAGAAAGGTGAGATTTTTTGAGTAAAACCTTGAATTTTGCACTAAAAATTTG 
CAATTTTCGCTAAAAATTACCTTAAAACTCGAAAATTGGAATTTCTAGCTGAG 
AAAATGGCCAAAAATGTCGAAAAATGCCTCCGAAACCTGTGAAAAAAAAAA 
CCACCAAAAAGGTTTCTAGGCCACCAAAAAGATTTCTAGGCCACCAAAAATG 
jTTrTArrGrrArCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACC 
AAAAATGfTTCTAGGCCACCAAA^ 

ATGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCCCCAAAA 

AATTTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGC 

CACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGT 

TTCTAGGCCACCAACCAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCA 

AAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTA 
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GGCCACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAA 

TGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGTTTCTAGGCC 

ACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAAGGTT 

TCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCA 

AACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAACAGGTTTCAA 

TGCCACCAAAAATGTTtCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAA 

TGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCA 

CCAAACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAACAGGTTT 

CAATGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCCCCAA 

AAAATTTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAG 

GCCACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAG 

GTTTCTAGGCCACCAACCAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCAC ^ 

CAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTC o O 

TAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAAGGTTTCAAGGCCACCAAA □ □ 

AAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAACAGGTTTCAATG Q g ^ 

CCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGACCACCAAAAAGG ^ 

TTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGTTTCTAGGCCAC ^ w - g 

CAAACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTC z S | ■ : 

TAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAA ^ § c z 

AAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAATGTTTCTAGG 1 1 | 

CCACCAAACAGGTTTCAATGCCCCCAAAAAATTTTTCTAGGCCACCAAAAAG [jj § | 

GTTTCTAGGCCATCAAAAATGTTTCTAGACCACCAAAAAGGTTTCTAGGCCAC ^^l^ 

CAAAAATGTTTCTAGACCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTC § o IB I 

TAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAA < ^ ^ | 

AAGGmCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGTTTCTAGG < ^ || 

CCACCAACCAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAAAAGG 1 J o g-^ 

nrCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACC 1^ < >^'2 ^ 

AAAAAGGTTTCTAGGCCACCAAAAAGGTTTCAAGGCCACCAAAAAGGTTTCA ^ ^- ^ g ° 

ATGCCACCAAAAATGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAA Q^J | ° ~ 

AGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGTTTCTAGAC i S §:i I" 

CACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGT i- O < u. o. 

TTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACC 

AAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCT 

AGGCCACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAA 

ATGTTTCTAGGCCACCAAACAGGTTTCAATGCCCCCAAAAAATTTTTCTAGGC 

CACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGACCACCAAAAAGGT 

TTCTAGGCCACCAAAAATGTTTCTAGACCACCAAAAAGGTTTCTAGGCCACC 

AAAAATGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCA 

ATGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCCCCAAAA 

AATTTTTCTAGGCCACCAAAAAGGTTTCAATGCCACCAAAAATGTTTCTAGGC 

CACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAATGT 

TrCTAGGCCACCAAAAAGGlXrCTAGGCCACCAAACAGGTrrCAATGCCACC 

AAAAATGTTTCTAGGCCACCAAACAGGTTTCAATGCCACCAAAAAGGTTTCT 

AGGCCACCAAAAATGTTTCTAGACCACCAAAAAGGTTTCTAGGCCACCAAAC 

AGGTTTCAATGCCACCAAAAAGGTTTCTAGGCCACCAAACAGGTTTCAATGC 

CACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGT 

TTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACC 

AAACAGGTTTCAATGCCACCAAAAATGTTTCTAGGCCACCAAACAGGTTTCA 
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ATGCCACCAAAAATGTTTCTAGGCCACCAAAAATGTTTCTAGGCCCCCAAAA 

AATTTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGAC 

CACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGACCACCAAAAAGGT 

TTCTAGGCCACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACC 

AAAAATGCTTCTAGGCCACCAAAAATGTTTCTACGCCACCAAAAGCCG CCTC 

AAGCCCGAAAAATTTGAATTTCCCGCTCAAAAAATCTAAAATTTTCCGATTTT 

CAGACGAATCAGATGATGTGGTGATGCTGGACGACGATGATGACGACACTCC 

GGAACCGATTCTCGTGATTGATATGGATGAGGATGAGGATGTTACTACAGAT 

GGTCCTGAATCTCAGGAAGAGCTGGCTGCAGATGCTCCGGCTCCAGGAGCTC 

CAGAAGCTTCAGCTCCAGCTCAAGAAGCCTCAGAAGCTTCAGCTCCGGATCA 

AGAAGCTCCAGAAGTTCAGGATGTTCCGGATTCTTCGGGAGCTCCAGATGCT 

TCAGCTCAGGCTTCAGAGGCTTCTGATGCTTCAGCTCCAGAAGTTCCAGGATC 

TACAGAAGCTCAGGATGCTCAGGATGTTCCGGATTCTTTGGGAGCTTCAGAT ^ ^ 

GCTTCAGCTCAAGAAATTGCAGAAGCTCCAGAAGCCCCAGAAGCTCCA GAAA □ □ 

TCGCCGCTGAAATCGACGAAGAAGTGCTGCTCGCCGAGCAAAATGGAGTTTT q 2 

GGACGAAGGATTTGATGAGACTGACGATATTATCATAGAAGAAGAAGCTGTA § g | g 

GAAGAAGCTGAAGCCGTGGAGCCACCAATTAACACTGAAAATCAGGAAAAC ^ w ^ « 

GCGCTGGAAATGCTCGAAGAGCGCCTCAAGAAGAATGAAGAAAAGGAAATT z S z 

GTGGAGAAAAGTGATGTGAAGCCAGAGGATGAAGATATTATACATATGGAG ^§ |i 

ACGGATTCAGTTGAAAGTATGGGCTTTTTTAGCTGGAAAACAGGAAAAAAGA | w 

GCAAAAAATTGATACATTTCCAGCTTAACCAATCTTTTTTTGAGTTGTAAAGC ^ [^j g| ' 

CTGAAAATTGAGATTTTTGTACCAACTTTTATGATAAAGCTGAAAAAAAAATT ^. § ; 

AATTTTTTGACGAATTTTTAGCGGAAACCCTGAAAACATGTTTTGTCTGAAAA i o Tb ^ " ■ 

ATACAGAAAATCGTCACTTmACAATAAATTCGAGATTTTTAGCTCAAAAAT < ^ ^ | 

ACAACATTATAGTGCAAAAATCTCAGAAAAAGCCAAAAATTTCATTCAAACA | ^ | § 

TCTCAAAAAAAGCAGAAATTTTACTCAAAATATCTCAGAAAAAGCTAAAATT i jf o g- 1 

TTCCCAAA AAATCCCAG AAAAAGCAG AATTTTCATTCAAAATTCCCAG AAAA ^ < £. ^ 5 

AGCTGATAATTTACTAAACAATCTCAGAAAATGCTGAAATTTTACTCAAAAG 1 

TCTTCATAAAAAGCTGAAATTTTACTTTAAAAGTTTAGGAAATGCTGCAATTT °^lu|^<2 ! 

CACTTAAAAATCCCAAAAAAGCTAAAATTTTCCCAAAAAATCCCAGAAAAAG J g g-^ a> , 

CAGAAATTTTACTCGAATATCTCAAAAAAAAAAAAGCTGAAATTTCACTCAA « < Q- ; 

AAATCCCAGAAAAAGCTAAAAATTTACTAAAAAATCTCAAAAAAAAAAACG | 

CTAAAATTTCACTCAAAAATCTCAGAAAAAGCTAAAATTTTACTCGAATATCT 

CAAAAAAAAAAACTGAAATTTTCCTAAAAAATTTATGAAAAACCGAAATTTC 

ACTTAAAAGTCTCATAAAAAGCCGAATTTTCCCAAAAAAATCCCAGAAAAAG 

CTAAAAATTTACTTTAAAATCTCATCTGTAATTTTAGTTTAAAATCTCAGAAA 

AACCCGAAATTTCTCTCAAAAATTTGCTGATTTTCAAATTTTCAGCGTCAAGC 

CGCAAACGTACTGGCGGAGCCACAAGTCCGCGGAGCCCGGCTCAAAAACGA 

CCAAAACGACGTGTTCAAACGTTATTAAAGATGCGTCAGAATGCAATTGAAC 

TATTGACACGACTTTATGGCTCATGGGATGCACAATTGAGCCTCTCAAA TCTT 

GAGACAATTCGATTGTTGGGTGTCAATAATAATAGGAAGCTTATCGAAATTTT 

TGAGGAGAATGAGCAAGGTTAAAGCGTTTXrAAATGCTAXGAAAACTGACAA 

ATTTTCGATAAAAAAACGGATTTTTGGAAGAAAATCGCCTGAAAATTCATGT 

TTTTCTGCAAATTTTGACCAAATTCCCAAGAAAAATACGATTTTTTAGTCCGA 

AAATGCTCCAAAAAGATTTCTAGGCCACCAAAAAGGTTTCTAGGGCACCAAG 

AAAGIXrCTAGGCCACCAAAGTATTTATAGGCCACCTAAGATGTTTCTAGGCC 

ACCTGAGATGTTTCTAGGTCACCAAAAATGTTTCTCGGTCACCAAAAATGTTT 

CAAGGCCACCGAAAAGGTTTCTAGGCCACCTAAGTATTTCTAGGCCACCTAA 
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GATGTTTCTAGGCCACCTGAGATGTTTCTAGGTCACCAAAAATGTTTCTAGGT 
TACCAAAAATGTTTCAAGGCCATCGAAAAGGTTTCTAGGCCACCAAAGTATT 
TCTAGGCCACCTAAGATGTTTCTAGGCCACCTGAGATGTTTCTAGGTCACCAA 
AAATGTTTCAAGGCCACCGAAAAGGTTTCTAGGCCACCAAAAAGGTTTCTAG 
GCCACCAAAAATATTTCTAGGCCACCTAAGATGTTTCTAGGCCACCTGAGAT 
GTTTCTAGGCCACCTGAGATGTTTCTAGGCCACCTGAGATGTTTCTAGGTCAC 
CAAAAATGTTTCTCGGTCACCAAAAATGTTTCAAGGCCACCGAAAAGGTTTC 
TAGGCCACCTAAGTATTTCTAGGCCACCTAAGATGTTTCTAGGCCACCTGAGA 
TGTTTCTAGGTCACCAAAAATGTTTCTAGGTTACCAAAAATGTTTCAAGGCCA 
TCGAAAAGGTTTCTAGGCCACCAAAGTATTTCTAGGCCACCTAAGATGTTTCT 
AGGCCACCTGAGATGTTTCTAGGTCACCAAAAATGTTTCAAGGCCACCGAAA 
AGGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATATTTCTAGGC 
CACCAAAAATGTTTCTAGGTCACCAAAAATGTTTCTAGGTCACCAAAAATGT e, ^ 
ATCAAGGCCACGAAAAAGGTTTCTAGGTCACCAAAAATGTTTCTAGGCCACC 1 1 
AAAA-ATGTTTCTAGGTCACCAAAAATGTTTCTAGGCCACCAAAAAGGTTTCT g < 
AGGCCACCAAAAAGGTTTCTAGGCCACCAAAAAGGTTTCTAGGCCACCAAAA O g ^ ^ 
AGGTTTCAAGGCCACCAAAAAGGTTTCTAGGCCACCAAAAATGTTTCTAGGT §^ ^ « 
CACCAAAAATGTTTCTAGGCCACCAAAGTATTTCTAGGCCACCTAAAAGGTTT ^ | ^ j;; 
CTAGGCCATCAAAAAGGTTTCTAGGCCATCAAAAAGGATTCTAGGCCACCAA P o g d 
AAATATTTCTAGGCCACCTAAGATGTTTCTAGGCCACCAGAGTATTTCTAGGC i hi ^ ? 
CACCTAAGAGGTTTCTGGGCCATCAAAAAGGTTTCAAGTCCATCAAAAAGGT g li{ g e 
TTCTAGGCCACCAAAAAGGTTTCTAGGCCACCGAAAAGGTTTCTAGGCCACC g g ^ ^ 
AAAAAGGTTTCTAGACCACCTAAGACATTTCTAGGCCAACAAAAAGGTTTCT a ^^^3 
AGGCCACCAAGAAGCCGAAAAACTGTCTCAAATTCGAATTTTGCAGTGCTCA ^ o ^ 
AACAAAAAGTGTCCGCACTGACAGAAGAGCTGAAAAAGGAGAAGCTGGCTC ^ ^ ^ f 
ACGCGGGAACCCGTTCAGCATTGAAAGAATTGACTAATGAAATAACTGGAAT ^ < | f 
GCGTGTACAAATGAATAAACTACGTTCAATGGTCACTCAGCCTACGACTTCG f ^ x ^ 5^ 
AAAATTATTGATAGTTTTGTTCAACGTCATCAGGCTTTCGAGCAGCAACAACA 2 f «; ^ ^ 
ATTCCAACACCAACACCACCAACACCGACCAATAATGTTGGCTCCACGTCAT g w § ^ S 
CATCCGCCGCCGCCCCCGCATTTTACACCGAATCAACGGGCGGCGGCTCCGT ii 2 f ? g> 
ATCATCCGAATATGGTTCAACCGAATCGTCTTGCTGCTATGCCACATAGAAGA ^o^tS. 
CCGATTATTGGAATGCAGGTGAAAATGGAATGCCATGAAAATTTCGGGCCGG 
AAAATTTTGGAAAATCCTCTAAATTTTCAATATTTGTCGAAAAAATCTGACAA 
AAATCGTGTCAAAATTCAGATTTCCGGGAGAAAAATCGCATTTTTGAGTAAA 

AATTTTAGTCAAGAAAATTAAGAAAAATGCGAAAA TTTCG AGCAAAAAATAT 

AGTTTTTTGGAGCCGAAATTGTGAAAAATGCGATTTTTTTCGAAAAATCTGGA 

CAAAAAATTTCAAACAAGAAAAACCACTTTTTTAAAAAAATTTTCACACAAT 

-TTCCAGCAACAAAATTCGGCTCCACCACAATTCAACGGTCACCAAGCTCTCGT 

CCCATCACCTCAATCATCATCTGCATTTTCTCGTCCACCACCAACTCAACTTG 

CAACACAGAGAAGAGCTCCACCATTGGCAAGTACCGGCCTTCCGGCAACAGT 

CAGATGGGAAGCAATTCCACCGCCAAAAAATCCGAATGTCGGGCACAATGA - 

GCCACCGCTTAACAATGGAGGTTCGTCGTGTGCAACAAAAAGAGCACCGCTT 

TTCCACGACGAGTTTTTGCGATGATGATTTTGGTGTG AAAA TTGAAAAACTCA 

TTTTTTTAAAGTCTGAAATTTGAAAATTTGAGAAAAGTTTTTTAAAAAAAGTT 

TTATGAGGGATTTTCTGACAATTTTTTATAAACGGAAAATTACGAAAACTCCA 

AAATTTGTGTTCTTTCGGAAAACGAATTTGAAATTTGAACCAAAATTTTGACA 

ATTTTCTGGGGATTTTTGACTGGAAATTCGTTTTTCATCGATTTTTCCTCCTTT 
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AATTTTCGGTAAAACCCCTGTCTCCAATTCCAGGCCGTGCACAGCCACTAATC 
GATAATACACGTGTACACGACAATACAATTATGCTGTGTGTACCACTTGTCTC 
CACTGCAAATACAATATCATCGGGCGATTCGACACGTCTACCAAAAGTACCA 
CGAATCTACGAGAATCTCACGGCAAATCCCGATTTGAGTGTGACGATTCATTC 
GAGTGCACAGGATTTCCGAGAGAATTATCAAATTGGTGGAAAGATTAACTAT 
GAATATCTCGGAGGATTTGATCAATATGTAGGTGATGATGTTTTTTTATTGAG 
AGATAAATACGAAATTCCATTACAATCGATATTTTTTGACTGAAAAATGTCTG 
AAAAATCAAAAATTTTAGCTAAAAATTGAGAATATTTTTGTTTAAAA AAAA T 
CATTGAAATTGATTTTTTTTTATTCCATAAAAATCTCGGAAAAGTCAATTTTC 
AGTCATAAATCTTCTGAAAATTATCCAAACAATGGGATTTTCTGAAATTTTAG 
CTTAAAAATTGAGGATTTCCCGGTTTTTTCAGAGAAATTCCATTACAATCGAT 
TTTTTTACTGAAAAATCCTCTGGAAATTAACAAAAACCAAATAAAAT GCCCT 
AATTTTTTTTTAAATCCAAAAATTGTTGGATTTTTTCAGAAAAAAATATTTTTT 
CAATTGACTGGTGTGCAAAAAATATAGAAAATTCAAATTTTCCAAGAAAAAT 
"TAGCCAAAAAAATGTAATTTTTGTCTAACAAAAAAATTGAATAGCGCAAAATT 
AAATTGTCGTTTTTTTTAATTTCCCTCCGGTTTTGAAAGGAAA AAAT TCCATA 
AAAATCGAAATTTTTTGACTGAAAAATCCATGAAAACTCGAATTTTGAGTCA 
AAAATCCTCTGAAAATGCTCCAAAATATGAGATTTTCTGAAATTTCATCAAAA 
ATTAAGAATTTCACGGTTTAAAAAAAATTCCATTAAAATCGATATTTTTCAAG 
TGAAAAATCTCTGGAAAACTCGATGTTTGAGTCAAAATTCGTCTGAAAATGC 
TCCTTTAAATTGAAAAATTGAAAAAAAAACCGCCCACAATATTTGCAGAATA 
TCCAAGTGTTCGTCCAAGTGTCATCTCTTAAATTCACTGGAATGAACGGTTAC 
CCGGATCCAGAAGATCGTATATCAATTGACTGGGGATGCTCGAAATTGTGGC 
CTTGTAAGCCGAAATCTCATCACAAATTCCGTGTACGCTTCCATCAAGCACAA 
CTGCTGCCGAAGAACGATCGAATTACGATTGTGGCTGTGGCGAAGGATAAAA 
CTAGCGGAATTATTCACATTTCGCAGGTGAAAAATTGGAAAAT TTGC ACAAA 
TCCAGACAAAAAAAACTGAAAAATCGAAAAAATTTTTGTAATTTTTTGCCGA 
AAACGAAAATTAAAAACTGATAAAAATTGATTTTTAACCGGAAAATCCCTGA 
AAAATCAAACATTTTTTGCTAAAAATTGAGAATTATACGGTTTTGGGTAAAA 
AAAAACTATTTAAAAAAAATATTTTTTCTTTAAAAATCTCAACAAAAAAAAA 
ACCAATTTTCATTCAGAAATCCCCCCGGAGAATTGTCAAAATTTTGGGAATAC 
TCTGAAATTTCGATAAACACCTCATTTTTGATTAAAATTGATTTTTTAACTGA 
AAAATCCCTTAAAAAACGAATATTTTAGTTTTTTCACAAAAAAATGTGCAATT 
TATCTGAAATTTCAGCAAAAAAAATGAAAAAAAAAAATTCCGAAATTAAAA 
ACTGATAAAAATCGATTTTTTACTTGAAAAATTCGTGAAAAATCAAACACATT 
TTTGCTAACCATTGAGAATATTACGATTTTGTGAAAA AAAA AACCATTAAAA 
TTGATTTTTTATTCCTAAAAAATGCCAGAAAAATCAATTTTCAGTCAAAAATC 
ACCGGAAAATTATCAAAATTTTGAGGTTTTCTGTGAAATTTCAAGCTGAAATT 
TCCATTTTTGAATAAAAAAAATGTGGCTGGATTTAAAAAAAAACCATTAAAA 
TTGATTTTTTAACTGAAAAATCCGTATTTCTCTGAAATTTCAGGCAAAAAATG 
TCATTTCCGAAATTAAAAATTGCGACAAAATCAAATAAAATTGA TCAA ATTT 
GCAAAAAA AAAAA AACITT CGCAMAA ATCCTTAAAATTTACAinTlCAA^ 
AAAAACTCGAATTT^^^ 

GATTTTTTGAAATTTTAGCTAAAAATTGAGAATTGCACGGTATTTAGAGAGGG 

AAAAATTCCATAAAAATCGATATTTTCCTCTTTAAAATCT CGAA AAAAATCAT 

CAATTTTCATTCAAAAATCCCCCCCGGAAAATTGTCAAAATTTTGAGATTTTT 

CTGAAATTTCACGCAAAAATTTTCATTTTTTCAGCCCA CCTTCAT CACTCTCGA 

ATGATCGATCTCTTCACGTCAAATGCACTTTTTTCTGGATTTTTTTGTTAAAAA 
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ATTTGAAATTCTCGTGTTTTTTCTTCTGAAAAATTGCTTTTTTTGATTTTTTCTG 
TAATTTTTTTTTTGTTGATTTTCTTAATTTTTTTAAT^ 

ATCTCTTTCTCTCTCTCTCTGAATCTCAATTTTTTCCTGAATTTCCCCGTTTTTT 

TCTGATAATTTTCAATATTTCTCTGAATTTTTCTATTCCCCCCGTTGTAATGCC 

AAAATATGTGGTAATTTCTCCCCATTTTTTCGCTTTATTACTATTTATTCTATT 

CAATTGGTGCCTCTCTCAATGTGTTGTATGAAAAACACTGTTTTATGGAGGTT 

TTGGAGAATTTTGAATTTTTTCGTCGTGATTTTTATTGGTTTTCTTTACCAATT 

CAATTTTTTTTTTAATTCGAAAATTTGTAGAAATTCACTTTTGTAGCTTAAAAA 

ATTAAAAATTGAGAAAATTTGTTCAAAAATGGCAAAGTTTTCGAAATTTTAGT 

CTAAAAAAAGATTTTTTTAATATAGAATTTTAAAAAATTAGCACAGAAAAAT 

GCCGAAAAATTCGTAATTTTTCATTTAAAAATGAAAAAAAAAAAAAACAAAA 

AAAAAAAAAAAAAAGAGGGAAAAATCCCATTAAAAGTAGTTTTTTGACTGC 

AAAATCGTCTGGAAATTAACAAAATTTAAAAAAATCTTTTTTACAGCCCATCG 

TTTCCAAAAACCAAATAAAATGCCAAAAAAAAATTTTTATGCAAAAATTCTG 

GATTTTTTTCCGATTTTTTCAAAAAATTCCCCCTTCTAAAAAAAATGGTGAAT 

TTGTTCCCAAAAACCCAAAATTTGAGATTTTCTAAAATTTTGGCAAAAATTAA 

GAATTTCACGGTTTTGAGAGGGAAAAACTCCATTAAAATTGATGATTTTATGA 

CTAAAAATTCCTAAAAAATCAATTTTCAGTCAAAAATTAAATTT 
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1 aaggaattag actctttatc taaagtgaag aatgatcaat taagaagttt ttgtcccata 
61 gaattaaata taaatggatc tcctggggca gaatctgatt tggcaacatt ttgcacttct 
121 aaaactgatg ctgttttaat gacttctgat gatagtgtga ctggatcgga attatcccct 
181 ttggtcaaag catgcatgct ttcatcaaat ggatttcaga atattagtag gtgcaaagaa 
241 aaagacttgg atgatacctg catgctgcat aagaagtcag aaagcccatt tagagaaaca 
301 gaacctctgg tgtcaccaca ccaagataaa ctcatgtcta tgccagttat gactgtggat 
361 tattccaaaa cagtagttaa agaaccagtt gatacgaggg tttcttgctg caaaaccaaa 
421 gattcagaca tatactgtac tttgaacgat agcaaccctt ctttgtgtaa ctctgaagct 
481 gaaaatattg agccttcagt tatgaagatt tcttcaaata gctttatgaa tgtgcatttg 
541 gaatcaaaac cagttatatg tgatagtaga aatttgacag atcactcaaa atttgcatgt 
601 gaagaatata agcagagcat cggtagcact agttcagctt ctgttaatca ttttgatgat 
661 ttatatcaac ctattgggag ttcaggtatt gcttcatctc ttcagagtct tccaccagga 
721 ataaaggtgg acagtctaac tctcttgaaa tgcggagaga acacatctcc agttctggat 
781 gcagtgctaa agagtaaaaa aagttcagag tttttaaage-atgcagggaa agaaacaata 
841 gtagaagtag gtagtgacct tcctgattca ggaaagggat ttgcttccag ggagaacagg 
901 cgtaataatg ggttatctgg gaaatgtttg caagaggctc aagaagaagg gaattccata 
961 ttgcctgaaa gaagaggaag accagaaatc tctttagatg aaagaggaga aggaggacat 
1021 gtgcatactt ctgatgactc agaagttgta ttttcttctt gtgatttgaa tttaaccatg 
1081 gaagacagtg atggtgtaac ttatgcatta aagtgtgaca gtagtggtca tgccccagaa 
1141 attgtgtcta cagttcatga agattattct ggctcttctg aaagttcaaa tgatgaaagt 
1201 gattcagaag atacagattc ggatgatagc agtattccaa gaaaccgtct ccagtctgtt 
1261 gtggttgtgc caaagaattc tactttgccc atggaagaaa caagtccttg ttcttctcgg 
1321 agcagtcaaa gttatagaca ctattctgac cattgggaag atgagagatt ggagtcaagg 
1381 agacatttgt atgaggaaaa atttgaaagt atagcaagta aagcctgtcc tcaaactgat 
1441 aagtttttcc ttcataaagg aacagagaag aatccggaaa tttcttttac acagtccagt 
1501 agaaaacaaa tagataaccg cctgcctgaa ctttctcatc ctcagagtga tggggttgat 
1561 agtacaagtc atacagatgt gaaatctgac cctctgggtc acccaaattc agaggaaacc 
1621 gtgaaagcca aaataccttc taggcagcaa gaagagctgc caatttattc ttctgatttt 
1681 gaagatgtcc caaataagtc ttggcaacag accactttcc aaaacaggcc agatagtaga 
1741 ctgggaaaaa cagaattgag tttttcttcc tcttgtgaga taccacatgt ggatggcttg 
1801 cactcatcag aagagctcag aaacttaggt tgggacttct ctcaagaaaa gccttctacc 
1861 acgtatcagc aacctgacag tagctatgga gcttgtggtg gacacaagta tcagcaaaat 
1921 gcagaacagt atggtgggac acgtgattac tggcaaggca atggttactg ggatccaaga 
1981 tcaggtagac ctcctggaac tggggttgtg tatgatcgaa ctcaaggaca agtaccagat 
2041 tccctaacag atgatcgtga agaagaggag aattgggatc aacaggatgg atcccatttt 
2101 tcagaccagt ccgataaatt tcttctatcc cttcagaaag acaaggggtc agtgcaagca 
2161 cctgaaataa gcagcaattc cattaaggac actttagctg tgaatgaaaa gaaagatttt 
2221 tcaaaaaact tagaaaaaaa tgatatcaaa gatagagggc ctcttaaaaa aaggaggcag 
2281 gaaatagaga gtgattctga aagtgatggt gagcttcagg acagaaagaa agttagagtg 
2341 gaggtagagc agggagagac atcagtgccc ccaggttcag cactggttgg gccctcctgt 
2401 gtcatggatg acttcaggga cccacagcga tggaaggaat gtgccaagca agggaaaatg 
2461 ccatgttact ttgatcttat tgaagaaaat gtttatttaa cagaaagaaa gaagaataaa 
2521 tctcatcgag atattaagcg aatgcagtgt gagtgtacac ctctttctaa agatgaaaga 
2581 gctcaaggtg aaatagcatg tggggaagat tgtcttaatc gtcttctcat gattgaatgt 
2641 tcttctcggt gtccaaatgg ggattattgt tccaatagac ggtttcagag aaaacagcat 
2701 gcagatgtgg aagtcatact cacagaaaag aaaggctggg gcttgagagc tgccaaagac 
2761 cttccttcga acacctttgt cctagaatat tgtggagagg tactcgatca taaagagttt 
2821 aaagctfcg^g tgaaggagtsi"'^1^cacg^^^ att^tTattTtT' ca^^^ 

2881 aagaatgatg agataataga tgccactcaa aaaggaaatt gctctcgttt catgaatcac 
2941 agctgtgaac caaattgtga aacccaaaaa tggactgtga acggacaact gagggttggg 
3001 ttttttacca ccaaactggt tccttcaggc tcagagttaa cgtttgacta tcagttccag 
3061 agatatggaa aagaagccca gaaatgtttc tgcggatcag ccaattgccg gggttacctg 
3121 ggaggagaaa acagagtcag catcagagca gcaggaggga aaatgaagaa ggaacgatct 
3181' cgtaagaagg attcagtgga tggagagcta gaagctctga tggaaaatgg tgagggtctc 
3241 tctgataaaa accaggtgct cagcttatcc cggctaatgg ttagaattga aactttggag 
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3301 cagaaactta cctgtctgga actcatacag aacacacact cacagtcctg cctgaagtcc 
3361 tttctggaac gtcatgggct gtctttgttg tggatctgga tggcagagct aggtgacggc 
3421 cgggaaagta accagaagct tcaggaagag attataaaga ctttggaaca cttgcccatt 
3481 cctactaaaa atatgttgga ggaaagcaaa gtacttccaa ttattcaacg ctggtctcag 
3541 actaagactg ctgtccctcc gttgagtgaa ggagatgggt attctagtga gaatacatcg 
3601 cgtgctcata caccactcaa cacacctgat ccttccacca agctgagcac agaagctgac 
3661 acagacactc ccaagaaact aatgtttcgc agactgaaaa ttataagtga aaatagcatg 
3721 gacagtgcaa tctctgatgc aaccagtgag ctagaaggca aggatggcaa agaggatctt 
3781 gatcaattag aaaatgtccc tgtagaggaa gaggaagaat tgcagtcaca acagctactc 
3841 ccacaacagc tgcctgaatg caaagttgat agtgaaacca acatagaagc tagtaagcta 
3901 cctacatctg aaccagaagc tgacgctgaa atagagctca aagagagcaa cggcacaaaa 
3961 ctagaagaac ctattaatga agaaacacca tcccaagatg aagaggaggg tgtgtctgat 
4021 gtggagagtg aaaggagcca agaacagcca gataaaacag tggatataag tgatttggcc 
4081 accaaactcc tggacagttg gaaagaccta aaggaggtat atcgaattcc aaagaaaagt 
4141 caaactgaaa aggaaaacac aacaactgaa cgaggaaggg atgctgttgg cttcagagat 
— 4201-'caaaGacctg ccccgaagac- tcctaatagg tcaagagaga gagacccaga caagcaaact 
4261 caaaataaag agaaaaggaa acgaagaagc tccctctcac caccctcttc tgcctatgag 
4321 cggggaacaa aaaggccaga tgacagatat gatacaccaa cttctaaaaa gaaagtacga 
4381 attaaagacc gcaataaact ttct-acagag gaacgccgga agttgtttga gcaagaggtg 
4441 gctcaacggg aggctcagaa acaacagcaa cagatgcaga acctgggaat gacatcacca 
4501 ctgccctatg actctcttgg ttataatgcc ccgcatcatc cctttgctgg ttacccacca 
4561 ggttatccca tgcaggccta tgtggatccc agcaacccta atgctggaaa ggtgctcctg 
4621 cccacaccca gcatggaccc agtgtgttct cctgctcctt atgatcatgc tcagcccttg 
4681 gtgggacatt ctacagaacc cctttctgcc cctccaccag taccagtggt gccacatgtg 
4741 gcagctcctg tggaagtttc cagttcccag tatgtggccc agagtgatgg tgtagtacac 
4801 caagactcca gcgttgctgt cttgccagtg ccggcccccg gcccagttca gggacagaat 
4861 tatagtgttt gggattcaaa ccaacagtct gtcagtgtac agcagcagta ctctcctgca 
4921 cagtctcaag caaccatata ttatcaagga cagacatgtc caacagtcta tggtgtgaca 
4981 tcaccttatt cacagacaac tccaccaatt gtacagagtt atgcccagcc aagtcttcag 
5041 tatatccagg ggcaacagat tttcacagct catccacaag gagtggtggt acagccagcc 
5101 gcagcagtga ctacaatagt tgcaccaggg cagcctcagc ccttgcagcc atctgaaatg 
5161 gttgtgacaa ataatctctt ggatctgccg cccccctctc ctcccaaacc aaaaaccatt 
5221 gtcttacctc ccaactggaa gacagctcga gatccagaag ggaagattta ttactaccat 
5281 gtgatcacaa ggcagactca gtgggatcct cctacttggg aaagcccagg agatgatgcc 
5341 agccttgagc atgaagctga gatggacctg ggaactccaa catatgatga aaaccccatg 
5401 aaggcctcga aaaagcccaa gacagcagaa gcagacacct ccagtgaact agcaaagaaa 
5461 agcaaagaag tattcagaaa agagatgtcc cagttcatcg tccagtgcct gaacccttac 
5521 cggaaacctg actgcaaagt gggaagaatt accacaactg aagactttaa acatctggct 
5581 cgcaagctga ctcacggtgt tatgaataag gagctgaagt actgtaagaa tcctgaggac 
5641 ctggagtgca atgagaatgt gaaacacaaa accaaggagt acattaagaa gtacatgcag 
5701 aagtttgggg ctgtttacaa acccaaagag gacactgaat tagagtgact gttgggccag 
5761 ggtgggagga tgggtggtca ggtaagacag actctaggga gaggaaatcc tgtgggcctt 
5821 tctgtcccac ccctgtcagc actgtgctac tgatgataca tcaccctggg gaattcaacc 
5881 ctgcagatgt caactgaagg ccacaaaaat gaactccatc tacaagtgat tacctagttg 
5941 tgagctgttg gcatgtggtt agaagccatc agaggtgcaa gggcttagaa aagaccctgg 
6001 ccagacctga ctccactctt aaacctgggt cttctccttg gcggtgctgt cagcgcacag 
6061 acccatgcgc atccccaccc acaacccttt accctgatga tctgtattat attttaatgt 
6121 atatgtgaat atattgaaaa taatttgttt tttcctggtt tttgtttggt tttcgttttg 
6181 cttttagcct ctacatgcta ggatcacagg aagactttgt aaggacagtt taagttctcc 
— ^62'4l'"'t~§'ea3'g^§t^^^ 

6301 cagccttgtg ctatgttgat aagattgatt tactgcttaa aatcacttta ctttatccaa 
6361 tttttactga actttttatg taaaaaaata aaatcaatta aag 
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KELDSLSKVKNDQLRSFCPIELNINGSPGAESDLATFCTSKTDAVLMTSDD&VTGSELSPLVKACMLSSNG 

FQNISRCKEKDLDDTCMLHKKSESPFRETEPLVSPHQDKLMSMPVMTVDYSKTWKEPVDTRVSCCKTKDS 

DIYCTLNDSNPSLCNSEAENIEPSVMKISSNSFMNVHLESKPVICDSRNLTDHSKFACEEYKQSIGSTSSA 

SVNHFDDLYQPIGSSGIASSLQSLPPGIKVDSLTLLKCGENTSPVLDAVLKSKKSSEFLKHAGKETIVEVG 

SDLPDSGKGFASRENRRNNGLSGKCLQEAQEEGNSILPERRGRPEISLDERGEGGHVHTSDDSEVVFSSCD 

LNLTMEDSDGVTYALKCDSSGHAPEIVSTVHEDYSGSSESSNDESDSEDTDSDDSSIPRNRLQSVWVPKN 

STLPMEETSPCSSRSSQSYRHYSDHWEDERLESRRHLYEEKFESIASKACPQTDKFFLHKGTEKNPEISFT 

QSSRKQIDNRLPELSHPQSDGVDSTSHTDVKSDPLGHPNSEETVKAKIPSRQQEELPIYSSDFEDVPNKSW 

QQTTFQNRPDSRLGKTELSFSSSCEIPHVDGLHSSEELRNLGWDFSQEKPSTTYQQPDSSYGACGGHKYQQ 

NAEQYGGTRDYWQGNGYWDPRSGRPPGTGVVYDRTQGQVPDSLTDDREEEENWDQQDGSHFSDQSDKFLLS 

LQKDKGSVQAPEISSNSIKDTLAVNEKKDFSKNLEKNDIKDRGPLKKRRQEIESDSESDGELQDRKKVRVE 

VEQGETSVPPGSALVGPSCVMDDFRDPQRWKECAKQGKMPCYFDLIEENVYLTERKKNKSHRDIKRMQCEC 

TPLSKDERAQGEIACGEDCLNRLLMIECSSRCPNGDYCSNRRFQRKQHADVEVILTEKKGWGLRAAKDLPS 

NTFVLEYCGEVLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKWTVN 

GQLRVGFFTTKLVPSGSELTFDYQFQRYGKEAQKCFCGSANCRGYLGGENRVSIRAAGGKMKKERSRKKDS 

VDGELEALMENGEGLSDKNQVLSLSRLMVRIETLEQKLTCLELIQNTHSQSCLKSFLERHGLSLLWIWMAE 

LGDGRESNQKLQEEIIKTLEHLPIPTKNMLEESKVLPIIQRWSQTKTAVPPLSEGDGYSSENTSRAHTPLN 

TPDPSTKLSTEADTDTPKKLMFRRLKIISENSMDSAISDATSELEGKDGKEDLDQLENVPVEEEEELQSQQ 

LLPQQLPECKVDSETNIEASKLPTSEPEADAEIELKESNGTKLEEPINEETPSQDEEEGVSDVESERSQEQ 

PDKTVDISDLATKLLDSWKDLKEVYRIPKKSQTEKENTTTERGRDAVGFRDQTPAPKTPNRSRERDPDKQT 

QNKEKRKRRSSLSPPSSAYERGTKRPDDRYDTPTSKKKVRIKDRNKLSTEERRKLFEQEVAQREAQKQQQQ 

MQNLGMTSPLPYDSLGYNAPHHPFAGYPPGYPMQAYVDPSNPNAGKVLLPTPSMDPVCSPAPYDHAQPLVG 

HSTEPLSAPPPVPVVPHVAAPVEVSSSQYVAQSDGVVHQDSSVAVLPVPAPGPVQGQNYSW^DSNQQSVSV 

QQQYSPAQSQATIYYQGQTCPTVYGVTSPYSQTTPPIVQSYAQPSLQYIQGQQIFTAHPQGWVQPAAAVT 

TIVAPGQPQPLQPSEMVVTNNLLDLPPPSPPKPKTIVLPPNWKTARDPEGKIYYYHVITRQTQWDPPTWES 

PGDDASLEHEAEMDLGTPTYDENPMKASKKPKTAEADTSSELAKKSKEVFRKEMSQFIVQCLNPYRKPDCK 

VGRITTTEDFKHLARKLTHGVMNKELKYCKNPEDLECNENVKHKTKEYIKKYMQKFGAVYKPKEDTELE 
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Confidently predicted domains, repeats, motifs and features: 
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These features and domains are not shown in the diagram, either because their scores are 
less significant than the required threshold, or because they overlap with some other 

source of annotation: 
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Figure 31 LIN(n3628) Ftinctional domains 
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Confidently predicted domains, repeats, motifs andfeatures: 
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These features and domains are not shown in the diagram, either because their 
scores are less significant than the required threshold, or because they overlap 
with some other source of annotation: 
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Figure 32 KIAA 1732 Domains 
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